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PREFACE 


A decade ago we began working on the problem of analyzing psycho- 
therapeutic interviews. As time passed and as solutions to our 
research problems emerged, we began to realize the solutions had 
general implications. Going from the specific to the general carries 
within it the danger of tunnel vision but sometimes has the advantage 
of conferring uniqueness of perspective. It also has the realistic 
advantage of being grounded in actual research problems. 

Іп our view a fundamental emphasis in the following essay is the 
insistence on faithful representation of subject behavior over periods 
of observation. This may be contrasted with the usual transforma- 
tion of behavior to scores in which information is discarded and in 
which one draws the first step away from actuality. 

А second fundamental emphasis is on analyzing the representation 
of behavior with a minimum of assumptions. Although powerful 
and sophisticated mathematical techniques related to those discussed 
in this work exist, they seemed to us to require assumptions we were 
unwilling to make or to require analysis of but a fraction of the data. 
Thus the advantages of the use of distribution functions and of sig- 
nificance levels were, for us, offset and we were persuaded to follow 
our own path. 

As we proceeded in our search for a solution to our problems, the 
analysis of naturalistic data came to be viewed as a kind of factor 
analysis of behavior representations rather than of scores. If it is 
a truism that one gets out of factor analysis what one puts into it, 
it is also true (if not a truism) that factor analysis often informs our 
understanding of what actually was put into the mill of routine 
eventuating in what is symbolized by “factors.” It is so with rep- 
resentations of behavior. What amounts to a kind of factor analysis 
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results in classes of behavior which may appropriately be called 
“factors.” These mathematical partitions may serve as a ground 
for hypothesis formation, psychological induction and further, more 
refined study. 

We conceive of the process of naturalistic observation and analysis 
as involying observation, the representation of the results of observa- 
tion, analysis of the representation and the creation of a new frame- 
work for observation and further analysis culminating finally in 
rather formal hypotheses, theories and controlled experimentation. 
Methodologically our essay may be characterized as factor analytic 
or Thurstonian; logically, as Baconian; and procedurally as Darwinian 
or naturalistic. 

The presentation presupposes an understanding of factor analysis 
in the tradition established by L. L. Thurstone and an understanding 
of matrices and vectors covered by a substantial portion of, say, the 
first three chapters of S. Perlis’ Theory of Matrices. Nearly all of 
the mathematical statements are very common currency, so it was 
considered unnecessary to offer much in the way of mathematical 
proof and development. No mathematical buttressing of the point 
of view offered is intended or given. For our purposes the mathe- 
matics was subordinate and had the status of hammers, screwdrivers 
and other homely but necessary tools. 

Although our use of the methods to be described has been limited 
to the analysis of psychotherapeutie interviews, we can envisage 
other uses by clinical and counseling psychologists, social psycholo- 
gists, content analysts, comparative psychologists observing organ- 
isms in relatively free situations and, perhaps, taxonomists. 

The authors have been most deeply influenced by some of the 
illuminating early papers of Professor Louis Guttman, by the factor 
analytic approach of the late Professor L. L. Thurstone and by 
the concept of identifiability enunciated by Professor Tjalling 
Koopmans. The list of references, with the exception of the refer- 
ences to latent class analysis, is limited to the work directly influ- 
encing our thinking. 

We wish to express our appreciation to the Кога Foundation for 
allocating funds making our investigations possible, to thank Mr. L. 
Harmon Hook for the mathematical aid and advice he so generously 
afforded us and to thank Professor Jack Sawyer for careful and 
critical reading of two manuscript versions. 

We are also indebted to the American Psychological Association 
for permission to reprint the discussion on pages 178—184 and the 
tables on pages 196—198 of Volume II of Rescarch іп Psychotherapy, 
edited by Drs. Lester Luborsky and Hans H. Strupp. 
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CHAPTER ONE 


INTRODUCTION 


When one considers the philosophy and teaching of science, it seems 
significant that little, if any, formal attention is paid to naturalistic 
observation and rescarch, although it is quite clear that they consti- 
tute the foundation of all science. The science of physics is pro- 
foundly indebted to the uncontrolled observations of superstitious 
Babylonian and Chaldean priests. Recently we have witnessed 
widespread observances of the centennial of the masterwork of 
naturalistic observation and research, the Darwinian theory of 
evolution, which is still revolutionizing our concept of the biological 
world and, indeed, of ourselves. Yet in many current discussions of 
science by philosophers or by scientists in the role of philosophers, we 
find a great deal of emphasis placed upon what are considered to be 
the two major aspects of science: experimental method and logic, and 
the personal element, the latter being discussed under such headings 
as "genius", “ereativity’”, “scientific imagination", and “luck”. 
Although scientific method is greatly admired, it seems that the 
practitioner of scientific method, the experimental scientist, is not. 
The skilled experimentalist at best attains second rank as a scientist ; 
the highest rank he can hope to attain is that of Boyle. The deepest 
admiration is reserved for the Newtons, Einsteins, Darwins, and 
Flemings who are considered to be geniuses or to be lucky discoverers 
able to capitalise on the significance discerned in accidental events. 
Understanding science by division into the two aspects of objective 
method or verification and personal subjective elements creates a 
rather uncomfortable situation for the majority of scientists who do 
not possess genius and who realise that luck is a synonym for chance. 
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The sense of discomfort increases if one is in a borderline science, 
if one is exploring a field in which basic and fruitful concepts like 
those of Einstein, of Newton, and of Darwin have not been formu- 
lated; where experimentation, as in certain areas of psychology, seems 
to be reduced to empty formalism because positive knowledge does 
not exist which tells us what should be controlled. 

When a scientist enters into an inquiry in a new domain, his ideas, 
however precise as ideas, are hazy with respect to the domain simply 
because it is an unexplored area. When the phenomena under con- 
sideration are not described well, when relationships at the phenom- 
enal level are not known, the person who insists on starting out with 
precise experiments lacks the imagination required to observe, to 
look around and see, at the first level, what seems to be going on. 
Experiments which precede rather than succeed observation amount 
to being precise about vagueness and suggest a disproportionate con- 
cern for the opinions of other scientists with a disproportionate lack 
of concern for truth which, above all, should characterize the scientist. 
It is obvious that the scientist who wishes to bring a particular domain 
under the control of scientific law should attempt to observe and to 
arrive at preliminary hypotheses or systems of hypotheses on the 
basis of his observations. His ultimate aim, of course, should be 
identical to that of any other scientist, namely, to generate sets of 
propositions which, at crucial points, are verifiable through the proc- 
esses of scientific method. His immediate aim then should be to arrive 
at fruitful hypotheses. 

In our present stage of understanding science the creation of 
fruitful hypotheses must await the moment of insight of the genius 
or the lucky event which only the genius appreciates. Nevertheless, 
as psychological experimentation has shown quite clearly, the ap- 
propriate arrangement of situations can lead to insight into situations 
or can retard and prevent the occurrence of insight. The aim here 
will be to describe an approach to data of naturalistic observation 
which, it is hoped, will encourage the development of fruitful hypoth- 
eses with respect to the domain of inquiry. 


NATURALISTIC OBSERVATION 


As conceived here, the naturalistic observer approaches his domain 
of inquiry, considered to be some behavioral situation, with several 
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sets of convictions, beliefs, frames of reference, biases, prejudices, call 
them what you will, which will hereafter be called a classification 
system. To a first approximation, at least, the observer knows what 
behavior is. That is, he has a definition, explicit or implicit, of what 
behavior is, and he knows in a preliminary way what behavior he is 
interested in observing. Suppose that he observes a single organism’s 
behavior on a series of occasions and that each occasion covers a 
period of time such that several behaviors are observed. Then the 
observer has collected two sequences of observations, the sequence 
within occasions, and the sequence of occasions. It seems to be a 
reasonable assumption that if the observer is to formulate some hy- 
potheses based on the organism’s behavior, he will consider the 
behavior within occasions. Апа he will consider not only what does 
occur, but what does not occur that could be expected to occur. 
Also, he will note the sequence of various actions, or the absence of 
such a sequence, at each point in the course of the particular period 
of behavioral observation. Such considerations might lead to con- 
clusions or hypotheses about what might be called the behavioral 
structure of an occasion. But there is also a sequence of occasions. 
Therefore, there will be a set of occasions, each having its own struc- 
ture. The set of occasions constitute the data of our naturalist. 
The naturalist undoubtedly takes into account not only the sequence 
of occasions, but tries to see relationships among the behaviors over 
the whole set of occasions without regard to the structure of each 
oceasion. If the naturalist is observing many behaviors over many 
occasions, his observations can aggregate to an impressive number, a 
number which can easily aggregate to an overwhelming amount as 
the number of organisms observed increases. 

Suppose what probably is actually seldom the case, that the cate- 
gories of observations of the naturalist are mutually exclusive. Were 
this so, it could and will later be shown that the nature of classi- 
fication systems is such that it might well appear to the observer that 
there is order in the data in the sense that scientists think of order as 
of "underlying relations." Also, from well known experimentation 
in psychology on memory, learning, and forgetting, it is clear that 
the classification system of the observer may change continually as 
Observation proceeds, so that the very process of observation may be, 
and really always is, conditioned by the preceding observations. 
Thus, the very process of data formation, from one point of view, 
changes. From another point of view it can be said that the classi- 
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fication system of the observer is continually conditioned by the 
observations. Thus we see that all is in flux, the observer has à 
frame of reference which is not fixed, which can change as a result 
of the very process of observation. Г urthermore, what we know 
about memory indicates that additional changes in the frame of 
reference, and of the data conceived as the ordering of observation 
into the frame of reference of the observer, come about as a function 
of time. Bartlett's fascinating study of remembering indicates, that 
upon reflection after the events, the inner nature of the events seems 
to have changed. 

Consider that naturalistic observer, the psychotherapist. Essen- 
tially, he observes the communicative or linguistic behavior of a 
patient or group of patients over periods of perhaps hundreds of 
therapy hours. Therefore, even were his classification sub-classes 
few in number and mutually exclusive, the observations would 
amount to a really overwhelming array of data. Even when the 
psychotherapist takes notes, no matter how voluminous, they are, 
at best, aids to memory. The therapist, a Freud for example, then 
must remember over long periods of time and in terms of a large 
number of patients, what happened, what did not happen, ete., in 
addition to considering information not obtained in the therapy 
hours. Then he creates on what he conceives to be his observations, 
a theory of psychotherapy and a psychological theory of personality. 
No matter how plausible and convincing these theories may be, it is 
clear that the interaction of observer and observed, the personal or 
subjective element, can and probably does play a large part in the 
creation of the theory – and in such a way that we do not know to 
what extent the theories describe the theorist or the patients. 

It would indeed be a humbling and corrective experience for а 
psychotherapist to have copies of the original situations, such as 
sound recording or sound motion pictures, played back after a case 
description and analysis. For he would undoubtedly discover that, 
while there.were “essential truths" in his formulation, what was de- 
scribed as having a dramatic unity had, in fact, far less unity, and 
was much more scattered throughout the observations than his 
formulations indicated. And he would also find that what on listen- 
ing or viewing seemed to be important events, had been neglected 
in his formulation. In other words, faithful copies of the events lead, 
upon presentation to the original observer, to a change in the classi- 
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fication system or hypotheses consequent upon his original observa- 
tion. Even were the psychotherapist unbiased or uninvolved, the 
case would be the same because of the large number of varied re- 
sponses displayed by his patient. 

The considerations outlined above suggest the function and prob- 
lems of naturalistic observation. The function of naturalistic ob- 
servation is to generate hypotheses concerning the underlying struc- 
ture of behaviors, if any; it is not to test hypotheses. The testing of 
hypotheses is in the province of experimental methodology and, in 
certain situations, of statistical methodology. Generally speaking, 
the use of experimental method depends upon control, and by control 
is meant the control of relevant conditions or variables. Seldom, if 
ever, can everything be controlled. Thus, the use of experimental 
method depends upon previous positive knowledge or upon insight 
gained in uncontrolled situations. But uncontrolled situations are 
just those covered by naturalistic observation. Hence, if we can 
improve naturalistic observation, we are more likely to be in a position 
to have insight into what should be controlled in advancing an ex- 
perimental expedition into the domain of inquiry. 

The function of naturalistic observation being the generation of 
hypotheses, one basic problem posed by such observation is immedi- 
ately apparent. It is what might be called objectifying the observa- 
tions or moving from the act of observation to the recording of data. 
The recording of data is thought of as obtaining a record of behavior 
in symbolic form; getting such data implies: 

1. a standard definition of what constitutes a response. 


2. a classification scheme or system into which responses are 
placed as they occur. 


In what follows it will be assumed that the investigator has a stand- 
ard and specifiable definition of a response. 

The function of the classification system in naturalistic observa- 
tion and analysis is to provide an objective record of behavior such 
that, once the period of observation has ended, one сап tell with 
certainty what was observed. Therefore, the classification should be 
objective in the sense that two observers would essentially classify 
behavior the same way; the classification scheme should be specifiable 
and reliable. To the extent that it is not, the classification system is 
personal and unreliable. Secondly, the classification system should 
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have mutually exclusive sub-categories unless the classifying behavior 
of the investigator is the object of investigation. When the classes 
are not mutually exclusive, cross-classifieation may be dependent; 
some of the possible cross-classifications will be mere artifacts. 

It seems that reliability and mutually exclusive sub-classes repre- 
sent minimal characteristics of a satisfactory classification system. 
A reliable classification system with mutually exclusive classes would 
provide an objective record of behavior over a series of observations 
and would push the problems of subjectivity and insight back where 
they belong, namely, into the creation of the classification system 
itself and to the generation of hypotheses based upon analysis of the 
classified behavior. Since the number of classified behaviors or data 
may be very large, it seems that an additional criterion of a min- 
imally satisfactory classification system would be that it be sus- 
ceptible to objective analysis, preferably analysis of a type which 
treats the classification system and the data as a unit. 

The three criteria of a minimally satisfactory classification system 
by no means exhaust the problems inherent in classifying behavior. 
For it is clear that an arbitrary number of classification systems can 
be applied in the same observational situation. And the same applies 
to the definitions of response. When we have two different universes 
of discourse, connected, possibly, only by a standard definition of 
response, two criteria of the relative validity of the systems might be 
called internal productivity and external productivity; internal 
productivity being some index of the amount of order in the behav- 
ioral data and external productivity being some index or веб of indices 
indicating relationships between the behavioral data and other data. 
А somewhat analogous distinction is made in mental test theory 
between the kind of validity described by the correlation between а 
test and the factor (or factors) it represents and the kind of validity 
described by the correlation between a test and an external variable. 
Given two classification Systems, the first having more internal and 
external productivity than the second, it seems clear that the first is 
preferable to the second. It should be noted, however, that onc 
System might have more internal and less external productivity than 
the other and vice versa; the two criteria might not point to the 
same decision. 

A further distinetion to be made concerns the "grain" of the 
classification systems. А fine-grained system is one with relatively 
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many sub-classes and a coarse-grained system, one with relatively 
few (and as few as two) categories. A system with only two sub- 
classes has, in effect, only one, since a two-class system can describe, 
at most, only one kind of organism-environment transaction, all 
other kinds being lumped together in the remainder class. Thus the 
two-class system represents the ultimate in coarse-grained systems 
for, of course, a one-class system is completely determined. A com- 
pletely determined system can yield no information. Since the 
behavior of organisms is complex, it would seem that a three-class 
system, and other relatively coarse-grained systems would be less 
likely to be productive, other things being equal, than relatively 
fine-grained systems. Fine-grained systems are less likely to have 
responses classified into the remainder class. 

In coarse-grained systems, behavior is likely to be classified into 
the remainder class because attention is centered on a few kinds of 
behavior; in fine-grained systems, however, such classifications are 
likely to be made because of inability to identify the class into which 
to order the response. In such a case the class “unclassifiable” be- 
comes a sub-class of the remainder class. In actual investigations 
it is probably desirable to record, in addition to the explicitly defined 
categories, the unclassifiable and remainder classes, for they will often 
be of considerable interest and could yield valuable information. 

А criterion related to that discussed above is exhaustiveness. Ап 
exhaustive system is one in which all responses are classified into 
explicit categories. It seems reasonable to suppose that the more 
exhaustive a system is, the more productive it is likely to be. 

In general then a satisfactory classification. scheme will have 
mutually exclusive sub-classes, be susceptible to routine analysis, 
and possess internal and external productivity. Productivity is in 
turn likely to be dependent upon “grain” and exhaustiveness. These 
rather rough, ready, and commonsensieal criteria are somewhat 
vague and lack the sense of closure obtained by means of a complete 
intellectual scheme. However, they do seem to be serviceable criteria 
and can be used to some extent for developing a classification system 
before observation formally begins. Some of the criteria, especially 
productivity, will be delineated more sharply later. 

One should remember that the behavior being considered here is 
completely uncontroled by the investigator. Nevertheless, the 
classification system is somewhat analogous to a control of behavior 
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in the sense that the observer insists on observing the behavior in 
terms of the classification system employed. Thus the observer is 
controlling his experience, as in an experiment, but, perhaps, in an 
undesirable manner. That is, his classification scheme may simply 
be an objectified set of prejudices having much to do with the mis- 
guided imagination of the observer but containing no insight about 
the kind of organism observed. In this case, it is likely that analysis 
of the data will reveal only the folly of the originator of the classifica- 
tion system. A properly conceived and executed experiment, how- 
ever, does not fail to yield real information. This is in the very 
nature of the process of experimentation. Therefore, the naturalistic 
observer must be as carefully imaginative as possible in constructing 
his classification system; he must fuse his creativity and experience 
for the purpose of constructing a more than minimally satisfactory 
classification system. 

The desirability of extreme care and maximal creativity in con- 
structing a classification system can be seen by considering a situation 
in which behavior is physically controlled by limiting the behavior 
of the organism to series of finite alternatives. Тһе maze in which 
the organism is restricted to right and left turn behavior is a good 
example; so are Fabrés caterpillars endlessly circling the post on 
which he placed them. Such simple situations, hardly to be conceived 
ав constituting full experimental control, have yielded a wealth of 
information about response acquisition and instinctive behavior. 
Now in the maze, for example, the alternatives ean be considered as & 
two-class classification system and analysis of the sequence of alter- 
natives expressed in the behavior of the organism can be handled, as 
will be shown later, in the same manner as with the classification 
systems which do not imply the imposition of physical limitations 
upon the behavior of the organisms being observed. To make the 
point in a rather fanciful manner, when the organism is merely ob- 
served, the organism "selects" as behavioral alternatives the sub- 
classes of the observer's classification system. When there is more 
control, choices are forced upon the organism by environmental con- 
ditions imposed by the investigator. In neither case is it remarkable 
that a “choice” is made. In the first case, however, “selection of 
alternatives” may have little to do with the organism observed and 
much to do with the observer. In the second case, selection of alter- 
natives actually represents transactions however “artificial” the en- 
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vironment may be. In other words, in the second case the behavior 
represents an actual organism-environment transaction. In the first 
case the classified behavior may show only that behavior occurred; 
the classifications may not represent any actual transaction between 
organism and environment. This is the defect of uncontrolled ob- 
servation; it may not be observation of more than the fact that re- 
sponses occurred a given number of times; hence the necessity for the 
basic criterion of productivity. For productivity guards against 
just the possibility that a given classification system is irrelevant to a 
given group of organism-environment transactions or responses; it is 
here conceived that an irrelevant but specifiable and reliable classi- 
fication system would, for a given set of responses, be unproductive; 
that is to say, an irrelevant system would result in data exhibiting 
no order, 


SUMMARY 


Exploratory investigations are necessary, and are most likely to be 
useful, when fruitful and basie concepts do not exist with respect to 
the domain of inquiry. Exploratory investigation almost necessarily 
starts with the expectation that underlying the manifold events in a 
domain of inquiry is an order; that a set of constructs will unify and 
simplify comprehension of a class of phenomena. When the scientific 
knowledge or the insight necessary for controlled experimentation is 
not at hand, the investigator can use naturalistic observations of the 
domain as a means of increasing his insight. By this means he may 
successively extend his insights to the point where controlled ex- 
perimentation becomes possible and desirable. 

In the behavioral domain of inquiry, naturalistic observation 
requires a definition of response and the application of a classification 
system to the responses observed. The classification system must, 
in order to be applied, have mutually exclusive categories and be 
reliable, capable of yielding the same results upon re-classification by 
а second observer. Since a given classification system cannot be 
considered to be unique, the problem of the worth or validity of the 
system arises. Worth (or validity) can be assessed in terms of inter- 
nal and external productivity. External productivity cannot exist 
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unless internal productivity also exists. The converse is not true. 
Productivity is more likely when classification systems are relatively 
fine-grained and exhaustive. Finally, а preferred classification 
system is one so formulated that its data, the behavior described in 
terms of the system, can be analyzed in an objective fashion, the 
analysis preferably being of the data considered as 2 whole. 

A naturalistic investigation is concluded when analysis of the data 
results in a demonstration that the system is unproductive or pro- 
ductive as the case may be. If the system is productive the end 
result should be a sharpened and refined set of hypotheses leading to 
a better classification system with which to approach the domain. 
Or it may result in a set of hypotheses so refined as to indicate what is 
to be controlled and varied in an experimental attack upon the 
domain of inquiry. 

The nature of exploratory investigation being what it is, it starts 
and ends with personal creation. Usually personal creation, the sub- 
jective element, pervades the entire process, often to the detriment 
of the investigation. The personal, subjective clement should be 
confined to the beginning of the investigation when the classification 
system is being devised and made explicit and to the end when the 
routines of analysis have provided the formal results upon which 
personal creativity should operate. 


CHAPTER TWO 


THE ANALYSIS OF EMPIRICAL 
CLASSIFICATION SYSTEMS 


The purpose of a naturalistic analysis of behavior data is to find a 
structure of relations among the data serving as a basis for hypothesis 
formation. More accurately perhaps, one should speak of hypothe- 
sis reformulation since the classification system itself is a system of 
hypotheses. Prerequisite to finding the structure of relations is the 
representation of the classified data and the assessment of internal 
productivity. When a classification system has been judged to be 
internally productive, one then proceeds to ascertaining the structure 
of relations in the data. In practice, assessment of internal produc- 
tivity and determination of structural relations may proceed simul- 
taneously. However, the former is logically prior to the latter. 
Logically, the manner in which data is represented is а matter of 
choice. But logie is not science and it is only logically speaking 
that one ean be arbitrary. In the behavioral domain, data comes 
in sequence; responses are made one after the other. Thus, if an 
individual organism is busily making responses, to each of which a 
number of some sort сап be attached; then the organism in mathe- 
matical terms can be considered to be engaged in constructing an 
ordered n-tuple of numbers. An ordered n-tuple is a vector and 
the particular behavior vector for a particular organism is then a 
representation of the behavior of the organism. In the point of 
view taken here, the data of the investigator are the behavior vectors 
and the data are representations of acts of observation within a 
classification system. Parenthetically, it is our belief that all acts 
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of observation are embedded in a classification system and that the 
concept of observation and the concept of classification are indivisible. 

Basic, then to the problem of ascertaining structural relations 
among data is the problem of coding data in such a manner that the 
coding represents behavior as it occurs, The representation must 
show not only what occurred but when it occurred. This prescrip- 
tion rules out many schemes of representation of behavior such as 
the conventional item-score matrix of mental test theory and the 
representation of so-called response patterns by vectors which are 
either identical or have zero inner products, i.c., are orthogonal. 
Such representations may be formally impeccable but materially 
represent decisions to ignore part of the behavioral data in some 
way. Such decisions should have more than a formal basis. In 
апу event, such decisions represent some kind of operation upon 
the data. 


ARRANGEMENT OF DATA 


Given a classification system with q mutually exclusive categories 
of responses applied to { organisms over n responses, with a unit 
entry denoting the oceurrence of response number m into category 
number p for organism j, the classified behavior of the jth organism 
can be represented as a column matrix 
8; = col (0,0,..., І, 2:05:10: 
0,...0.,0,0,...,0,0...,1) (1) 
where there are п unit entries, one for each response, and nq — 1) 
zero entries; that is, a total of 74 YOWS. A fictitious data matrix 
where n = 8, q = 2, and t = 25 is shown in Table 1. This fictitious 
data matrix shows just what responses were observed as being made 
and when, in the sense of succession, by each subject. Such a matrix 
is the result of observing and classifying the behavior of 25 subjects, 
the behavior being regarded as discriminable into eight responses and 
the responses being regarded as discriminable into two sub-classes. 
The data matrix in the form shown gives the “natural history” of the 
behavior exhibited by the subjects over the period of observation. 
The data matrix S will be said to represent the behavior in funda- 
mental form. It may be remarked that other representations of 
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TABLE 1 


Fictitious Data Matrix 8 
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Subjects (t 
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behavior classified into a finite number of mutually exclusive cate- 
gories could be used. However, all such classifications can be brought 
into fundamental form. More precisely, the other representations, 
when put into matrix form, can be expressed as lincar combinations 
of the vectors of S, but the matrices derived from other modes of 
representation will not yield the vectors of S. This is precisely why 
the S matrix is referred to as being in fundamental form. 


ASSESSMENT OF INTERNAL PRODUCTIVITY 


When the behavior of subjects has been coded and arranged in the 
form of the 5 matrix, the considerations outlined in the first chapter 
can be expressed in a more exact manner. For example, internal 
productivity can be assessed in terms of the randomness (or lack of it) 
of the distribution of responses. The chi-square statistic and its 
associated probability for a given level of significance may be regarded 
as a measure of internal productivity. The chi-square statistic 18, 
however, a rather unsatisfactory measure of internal productivity. 
A more satisfactory measure is the ratio of the actual rank of S to the 
maximal possible rank. The rank of S will be less than the possible 
rank of S only when some of the rows or columns are identical. Such 
a situation will obtain only when the behavior of the subjects exhibits 
marked order or consistency and it is the order or consistency of re- 
sponse which indicates that the classification system is internally pro- 
ductive. The maximal possible rank of an ng ЖЕ S matrix represent- 
ing the classified behavior of a group of subjects will be (1 — 1/q)ng + 
1 or, equivalently, nq — n + 1. Each sub-matrix Sm, correspond- 
ing to the mth response, nm, will have rank q, since for each response 
the sub-classes of the classification system are required to be mutually 
exclusive and each row vector of S represents one sub-class of the 
system. The requirement of mutual exclusivity of classification 18 
shown in S,, by a set of 4 row vectors having zero inner products. A 
zero inner product thus indicates that the row vectors have no unit 
entries in common. Sub-matrix Sn may be regarded as defining а 
Space of q dimensions containing a vector Y, with unit entries only 
whether or not it appears empirically; for the sum of the q rows of S» 
is just the vector Y. There is one matrix S, corresponding to each 
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of the » responses. Therefore the sub-spaces defined by each of the 
responses have the Y vector in common. Since the rank of any 
ng X t matrix where ng > tis at most nq, it follows directly that the 
rank of an nq X tS matrix is at most nq — п + 1. The maximal 
possible rank of S, па — n + 1 is the mathematical counterpart 
of the requirement that the sub-classes of the classifieation sys- 
tem be mutually exclusive. If, for a given set of data, the actual 
rank of S is r, then the expression r/(nq — n + 1), the rank differ- 
ential, which may vary between 1/(ng — n + 1) and unity, may 
be used as one measure for assessing internal productivity. The 
value of 7/(па — n + 1) will be less than unity only when some 
rows of S have identical placement of unit entries and when some 
rows have zero entries only. In this case a “preferential selection" 
of sub-classes of the classification system has been shown by the sub- 
jects. In a sense the “amount” of preferential selection is given by 
r/(nq — n + 1), the rank differential, and the "kind" by the behav- 
ior sub-classes which are identical. When a maximum degree of 
“preferential behavior" is shown by the subjects, their behavior is 
maximally consistent; the rank of S will be unity and the behavior 
of all subjects will be identical. When the behavior of sub-groups 
of subjects is identical within sub-groups, the behavior of each of the 
groups being different, the corresponding S matrix will show r dis- 
tinct groups of row (or column) vectors. 'The veetors of the ith 
group will be identical, and the inner products of the vectors in the 
ith and the jth groups respectively will be zero. If r vectors, one 
from each group, are selected, the ғ vectors form an orthogonal basis 
for the r-dimensional vector space defined by S and all subjects can 
be discriminated or classified perfectly in terms of the r vectors so 
chosen. Maximum discrimination in classification is then achieved 
when, for rank r of S, r orthogonal row vectors of S exist. It should 
be noted that for a classification system with 0 mutually exclusive 
sub-classes, there сап be по more than 4 orthogonal row vectors in S. 
Thus, maximal internal productivity for a given classification system 
implies that the associated S matrix has rank of at most 4 and that 
there be at most q orthogonal row or column vectors in S. Many 
different sets of r orthogonal row or column vectors of S may exist: 
Le., there are many different arrangements. When the rank of S is 
greater than 4, the data will be said to have an oblique structure іп 
the sense that, given a basis of empirical data vectors, at least r — q 
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of the vectors of the basis must have non-zero inner products with 
each other and with the 4 orthogonal vectors. The non-zero inner 
products indicate less than maximum discrimination between empiri- 
cal classes of behavior and, consequently, less than optimum internal 
productivity of the classification system. Internal productivity of 
the classification system 1s thus related not only to reduction in rank (the 
rank differential) but to the inner products of vectors from S chosen as а 
basis. "The higher the inner products between the unit basis vectors, 
the less internally productive the classification. Nevertheless, а 
given S matrix may have rank greater than 4 thus having an oblique 
data structure, but have high, though less than maximal, internal 
productivity. 

Implicit in the above discussion is the restriction of the basis to 
empirical data vectors taken from S. The reason for the restriction 
is that the classes of behavior represented by the row vectors of 5 
chosen for the basis are empirical classes and the structure of the data 
can then be described in terms of actually observed classes of behav- 
ior. To this restriction should be added the restriction that the co- 
ordinates of all data vectors be non-negative. These restrictions 
amount to confining the universe of discourse to what might be called 
the behavior space, which, as Table | clearly shows, must have like- 
signed entries when behavioral events of specified kinds are regarded 
as occurring or not occurring. 

The restriction that unit length basis vectors be normalizations 
of data vectors and that the coordinates of the data vectors be non- 
negative results in a unique orthogonal basis when the rank, r, of S 
is equal to or less than 7. This is the case in which there are r groups 
of vectors; within each group the vectors are identical and have 
identical inner products. ‘The inner product of any two vectors in 
the ith and jth groups is zero; the sub-classes of response do not inter- 
sect. Any change of basis then results in a basis such that the matrix 
of coordinates of the data vectors contains negative entries which 
cannot be interpreted in behavioral terms. It seems then that the 
restriction to non-negative coordinates is not merely arbitrary. 
When r > q, the data structure must be oblique in the sense that 
r — q of the basis vectors must have non-zero inner products and the 
obliqueness of the structure reflects less than maximal internal pro- 
ductivity in the classification System. That is, the discrimination 
among subjects in terms of the sub-classes of behavior is less than 
optimal. 
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When a data basis is oblique, complex problems arise in interpre- 
tation of the matrix of coordinates of the data vectors because of the 
non-zero inner products of the basis vectors which indicate that ambi- 
guities are present in the data and in the classification system. Thus 
the process of hypothesis formation, based on regarding the basis 
vectors as being generators of the data, becomes more complex. One 
way to circumvent this difficulty is to use the oblique data basis to 
define an orthogonal basis which "best" fits the data basis. Then 
the behavior classes defined by the vectors of the orthogonal basis do 
not intersect and are independent. The orthogonal basis represents 
а non-empirical set of behavior classes strictly determined by the 
oblique basis and “most similar” to it. Depending on the actual 
goodness of fit, the matrix of coordinates of the data vectors expressed 
in the orthogonal basis will be similar to the matrix of coordinates 
expressing the data vectors in the oblique basis. The similarity in- 
sures that similar hypotheses will emerge from the consideration of 
each matrix and makes it possible to avoid the complex problems 
inherent in considering the relations between intersecting or overlap- 
ping behavior classes. It is perhaps unnecessary to mention that the 
configuration of data vectors is not changed by the change of basis. 
Hence accuracy is sacrificed for simplicity and clarity when a best fit- 
ting orthogonal basis is used for hypothesis formation in place of its 
corresponding oblique basis. Fortunately this is more а theoretical 
than a practical problem; generally the two matrices of coordinates 
will be so similar as to lead to similar interpretations. If, however, 
the data basis is oblique, and if the investigator can use the non-zero 
inner products in the process of hypothesis formation, he should by 
all means do so, for the data basis best represents the data for the 


Purposes of inference. 


THE BEST DATA BASIS 


Many data bases can be derived from an S matrix, the vectors of 
which have an oblique structure; when the configuration of vectors is 
orthogonal, then the basis, under the restrictions previously stated, is 
unique. The unique orthogonal basis corresponds to perfect dis- 
crimination among subjects by the classification system and repre- 
sents maximal internal productivity. Thus an orthogonal data basis 
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is the best data basis since it corresponds to maximal internal produc- 
tivity of the classification system. In general, however, the rank of 9 
will exceed 4 and then the vectors of any data basis will have non- 
zero inner products, and the system will have less than maximal inter- 
nal productivity. 

The question can now be asked: Which among the several possible 
oblique data bases is the best? А preliminary answer to this question 
is that the coordinates of the data vectors of S be non-negative- 
Such a data basis corresponds to the intuitive notion of “outside” or 
“boundary” vectors. Given a set of data vectors with cosines less 
than unity and lying in a two-space, it is clear there is just one data 
basis for which the coordinates of the data vectors are non-negative 
and that this basis is a basis of “outside” vectors. However, it is not 
clear that in a higher dimensional space, there is just one basis of 
boundary vectors except in the case where the data basis is orthog- 
onal. In the latter case the basis vectors are boundary vectors П0 
matter what the dimensionality. Taken at unit length the orthogonal 
basis vectors comprise a collection of vectors B, such that the zero 
vector alone is orthogonal to the members of B. Such a collection 
is called orthonormal complete: an orthonormal complete set of vec- 
tors is always a basis for B and the members of the orthonormal com- 
plete set always have inner products of unity or zero. That the 
vectors of В are an orthonormal complete set implies that В is not 
contained in a larger set with members orthogonal to the members 
of B, for B is so defined that the zero vector alone is orthogonal to its 
members. Апа the zero vector has zero length so is not a member 
of B. Since the coordinates of the data vectors on the members of 
the orthonormal complete set, which form a basis for the vectors of S; 
are inner products, it follows that the sum of squares of the coordi- 
nates* for any vector is the self inner product of that vector and that 


= 1.00 (2) 


lii 

тт 

where a;; and h; are the coordinates and the length of the ith 
vector respectively. 

It can be shown that a value of unity for c is a criterion for ortho- 

normal completeness, so when c is unity the vectors under considera- 


*Coordinates are here defined as parallel projections of vectors on basis vectors. 
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tion are the veetors of an orthonormal basis. If c is less than unity 
for r linearly independent data vectors in a space of dimension 7, the 
r basis vectors are not orthogonal. If S is of rank т with c less than 
unity for all bases with unit length vectors taken from 5, the struc- 
ture of S is such that no orthogonal data basis exists. From (2) it is 
clear that for every data basis there is a value of с; the largest value 
of c is here taken to define the “most nearly orthogonal basis.” The 
more closely orthogonality is approximated, the less the intersection 
or overlap of the classes formed by the data vectors. In a collection 
of values of c, the largest value identifies the basis of vectors which 
corresponds to the intuitive notion of boundary or outside vectors. 
While it may be that given two values of c, с, and cj, the coordinates 
of the data vectors expressed in the two bases associated with c; and 
c; are non-negative (there may be more than one set of boundary 
vectors), the larger value identifies the basis which discriminates the 
best among the subjects under consideration. А rule for finding the 
best data basis can now be formulated: Among the finite number of 
data bases for a given matrix, S, identify a basis for which с has the 
highest value. 

Yor a given S matrix, there are па data vectors. Therefore, the 
number of bases to be evaluated is ngl/r! (па — г)!. It does not 
seem to be necessarily true that for a given matrix, 5, there will be 
but one largest value of c. If there are, for a given ease, two such 
largest values, there are two арса” data bases of r vectors each and r 
of the 2r vectors are, from the definition of a basis, linearly inde- 
pendent. If the vectors in the two bases are dependent, one by one, 
substitution of vectors from one set to another will not change ¢ and 
the two bases are in fact identical. If the vectors of the two bases 
are not collinear, the vectors in one basis are combinations of two or 
more of the vectors in the other basis. Substitution of vectors from 
one set to another can then only decrease c, for all combinations of 
vectors have been used to get the largest value of c. Therefore, 
manipulation of the data cannot improve matters and cannot, there- 
fore, resolve the indeterminacy. However, whether in fact, the 
indeterminacy exists for a given 5 matrix сап be determined by find- 
ing all values of c. If there is but one basis corresponding to the 
higher value of с the most nearly orthogonal data basis has been 
identified. 
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THE MOST NEARLY INVARIANT SET OF 
COORDINATES 


When a basis has been chosen it is hoped that the application of the 
classification system to additional samples of subjects will result in 
some kind of invariance for the original set of coordinates even though 
additional sub-classes may have been attached to the classification 
system. When the classification system has been applied to a new 
sample of subjects it may well be that the coordinates of the data 
vectors on the principal axes will be markedly different for the two 
samples because these axes are extremely sensitive to changes in the 
cosines of angles between vectors. Yet a kind of invariance may 
exist even though the cosines of the angles between the data vectors 
change. For example, those boundary vectors which are most nearly 
orthogonal, and which form a basis, are just those vectors on which 
some vectors, and at least r — 1 on each, will have zero coordinates. 
Also at least r vectors will have but one non-zero coordinate. Now 
those vectors having one non-zero coordinate define the basis vectors 
and these vectors reflect the most distinctly classified behaviors, that 
is; the behaviors most different from each other. Therefore, it 29 
just those behaviors which would be required to change the most in 
order to render the matrices of the coordinates of the most orthogonal 
bases totally dissimilar, If, for example, the data structure of the 
first S matrix were orthogonal with rank r and the structure of the 
second were oblique with rank s > 7, it could still be that a data sub- 
basis of r vectors in the second sample could yield a sub-matrix of 
coordinates of order ng Х r similar to the nq X т matrix of coordinates 
in the first sample even though cosines between data vectors were 
different in the two samples. 

Similarity between two ng X r matrices with rank p JU and Fy 
say, can be assessed by the following means. Consider the equations 
which transform one matrix into another: 

FX =F, (3) 
where X isr X r. Then pre-multiplying by F’ and then by (ғу 
өзе X = (PF) p'p, 

From (4) it can be seen that if F and Е; are identical, then X is ар 
identity matrix. If they differ by constants, X is a diagonal matrix- 
Further increase in dissimilarity is indicated by the appearance of 
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non-zero off-diagonal entries in X. Thus if X is an identity matrix or 
a diagonal matrix, it can still be said that a degree of similarity ob- 
tains between F and F;. Similarity between F and F; is what Thur- 
stone (Multiple Factor Analysis, 1947, pp. 364-67) has called config- 
urational invariance in factor matrices. A somewhat rough but still 
serviceable indicator of configurational invariance is given by the rank 
order correspondence of corresponding columns in F and Рі. Config- 
urational invariance in “factor pattern” is given by similarity between 
F and Р, in which values of unity have been substituted for non-zero 
values in F and F, and in which zero values have been given to zero en- 
tries regarded as being non-significant. Fora non-negative matrix of 
coordinates, then, it is just those matrices which are generated by the 
most nearly orthogonal bases which are regarded as most likely to be 
invariant from sample to sample in the sense of likelihood of exhibiting 
Similarity. If the goal of the investigator is to determine the most 
nearly invariant matrix of coordinates, then the most invariant data 
basis should be that which is “most nearly orthogonal.” Such a data 
basis will hereafter be called a simplest basis. If the simplest basis is 
oblique and it is desired to use an orthogonal basis, an orthogonal 
basis best fitting the simplest basis should be found. Best fitting 
orthogonal bases are discussed in Chapter 4. 


ANALYTICS OF THE SIMPLEST BASIS 


The concept of the simplest basis, or equivalently the simplest matrix 
of coordinates, implics stability of data structure from sample to 
Sample. It therefore also implies increasing stability of relationship 
With variates or criteria external to the classification system as one 
goes from the least invariant to the most invariant basis. The most 
invariant basis is expected to have the highest internal productivity 
Over a range of samples; it is also expected to change the least from 
Sample to sample. Given an S matrix with rank r and an orthogonal 
data structure, an orthogonal data basis is unique under the restric- 
tion of non-negative coordinates. Гога specific S matrix with rank r 
and Possessing an orthogonal data structure, there will be s > & > 

- > 5, data vectors collinear with each basis vector, the number of 
Vectors in each group being 8. For the sub-group of s; vectors all 
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non-zero coordinates will be identical. An orthogonal rotation of two 
or more of the basis vectors will result in a new matrix of coordinates 
such that there will be additional non-zero entries in the new matrix 
of coordinates. Length is invariant under orthogonal rotations 50 
the new non-zero coordinates will be greater in absolute value than 
their counterparts in the original matrix of coordinates. As a result, 
reduced column variation in the new matrix will be one effect of the 
rotation. 

Assume now that the vectors of S have been expressed in terms 
of the r principal axes of (1/088 a matrix of joint proportions, the 
data structure being oblique. In the space defined by the data 
matrix S, the unit vector on the first principal axis is the vector most 
similar to the data vectors: on the second, to those vectors in ther = 1 
space orthogonal to the first principal axis and so on. In this sense, 
therefore, they are central vectors and the first. principal axis may be 
regarded as the “central” vector for the data vectors. Hach non-zero 
data vector has a non-zero inner product with the first principal axis 
so for oblique data structures only the zero vector is orthogonal to the 
axis whereas for an orthogonal data structure non-zero data vectors 
will be orthogonal to the axis. Each data vector is, therefore, in the 
plane of two or more principal axes except when the data vectors ne 
collinear with the first principal axis. The sum of squares of a 
cosines of each data vector with the principal axes sum to unity gne 
each squared cosine gives the proportion of the vector accounted for 
by the principal axis in question. " 

From the fact that the squared cosines of the ith vector of S with 
the principal axes sum to unity it follows that the vector for which the 
sum of the squared cosines on ther — 1 principal axes, excluding the 
first, is the greatest, is that vector which is least similar to the remain- 
ing data vectors. This vector will have the lowest squared cosine 
with the first principal axis and will be called the pivot vector. а 
there are squared cosines on the first principal axis with values Jess 
than 1/7, these vectors will have at least one cosine on some other 
principal axis which is higher than that on the first. The pivot vector 
will be the vector in this group with the lowest cosine with the first 
principal axis and the vectors in this group with low intercosines ате 
likely to be vectors of the most nearly orthogonal basis. Those 
vectors with squared cosines between 1/r and 0.5 may have a cosine 
on one of the remaining principal axes equal to or greater than their 
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cosines with the first principal axis. Thus those vectors with cosines 
of .707 or less on the first principal axis comprise a group of vectors 
which may have cosines on the later principal axes which are equal to 
or greater than their cosine with the first. Within this group is a 
sub-group which must have one cosine on one of the r — 1 last princi- 
pal axes greater than the cosine on the first principal axis. This group 
of vectors contains at least one boundary vector and is the group 
likely to contain all or most of them. Ву starting with the vector 
with lowest cosine on the first principal axis and values in the range 
from zero to 1/V/7, in the range 1 /\/y to .707, and in the range from 
707 to .866, groups of vectors significantly and differentially related 
to the r — 1 last principal axes may be obtained. 

The row vectors in the па X г matrix of principal axes must 
have non-negative cosines with the first principal axis and both posi- 
tive and negative non-zero cosines with the 7 — 1 last principal axes. 
Consider first only the positive cosines on the last principal axis and 
assume that there is a cosine which is the highest in its column and 
higher than other cosines, positive or negative, in Из том. Then the 
data vector corresponding to this cosine is one of those least similar 
to the first principal axes and therefore to the other data vectors. 
Next assume another such cosine on each of the principal axes exclud- 
ing the first. The result is r — 1 cosines, each corresponding to a 
data vector, each maximally related to orthogonal basis vectors, and 
each being accounted for less than 50% by the first principal axis. 
Another such set can be obtained, providing it exists in the data, by 
Considering the highest negative cosines on the r — 1 last principal 
axes. The net result is 2(r — 1) data vectors, 7 of which are lin- 
about as different as possible from 


early independent, which are 
To this group of vectors 


the ng — 2, + 2 remaining data vectors. 
may be added, if it is not already а member, the pivot vector, the 
Vector with the lowest cosine with the first principal axis. We believe 
s contain the vectors of 


that this group of 2r — 1 vectors will alway 
However, we have 


Ше simplest basis or bases as the case may be. 
found no way to prove that this belief is in faet true. 

Since proof that the group of 2r — 1 vectors discussed above actu- 
Ally always contain the simplest data basis is lacking, one cannot be 
Sure that the basis finally picked is the simplest basis unless it is also 

nown that it corresponds to the highest value of c. This can be 
computed routinely but at considerable cost in computation. Тһе 
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number of values of с or of the sums of intercosines of vectors in 
different bases to be computed may be reduced by noting that if the 
group of 2r — 1 vectors does not contain all of the vectors in the 
simplest basis, it must contain most of them. This is guaranteed by 
the way in which selection insures that the 2r — 1 vectors are collec- 
tively quite dissimilar to the other data vectors. One can proceed 
by taking those r vectors, including the pivot vector, of the 2r — 1 
vectors which have the lowest sum of intercosines. This is the most 
nearly orthogonal basis for the 2r — 1 vectors. After determining 
the cosines of all data vectors with the vectors of the basis, a group 
of r clusters of vectors may be obtained by locating for each basis 
vector those vectors having higher cosines with the basis vector than 
with the first principal axis. Also they must have higher cosines 
with the basis vector than the basis vector has with the first principal 
axis. Taking values of с for various combinations of the pivot vector 
with the members of the clusters on the remaining basis vectors will 
identify the simplest basis, 

Usually 2(» — 1) vectors cannot be found which meet the con- 
ditions specified above. However, setting the same conditions for 
the r — 1 principal axes excluding the first usually gives 2r — 1 
vectors from which the simplest basis may be selected, and at least 
r will be obtained. Since taking linear combinations of the vectors 
in clusters has the advantage of using more than one vector to define 
a basis, it is tempting to do so. However, such a basis will then 
generate a matrix of coordinates with negative entries. A basis 
derived from combinations of cluster vectors may, however, be advan- 
tageous if the investigator is going to use a matrix of coordinates on 
orthogonal basis vectors. Orthogonalizing on the basis derived from 
linear combinations of cluster vectors very similar to the vectors of 
the simplest basis will most likely produce a satisfactory orthogonal 
basis. However, there is Still a risk of small negative coordinates, 
which would necessitate graphical orthogonal rotations. 

When there are two or more bases corresponding to the highest 
value of с, the differentiating vectors in the different bases will usually 
have high inter-cosines, Therefore, taking linear combinations, вау 
centroids, of the vectors differentiating the bases will give a satis- 
factory final solution, particularly if the final oblique basis is orthogon- 
alized. In other words the value of c derived from the basis arrived 
at by taking linear combinations of the vectors differentiating the 
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bases associated with the highest value of с will generally be very close 
to the highest value. In practice one сап expect the association of 
the highest value of c with more than one basis to be unusual; more 
usually the simplest basis will be unique in the sense that it will be 
the only basis associated with the highest value of c. Another way 
in which to choose one from among different simplest bases is to find 
the sum of the variances of the coordinates on each basis vector. 
The basis corresponding to the highest sum of variances should then 
be selected as the final basis. If more than one simplest basis corre- 
Sponds to the highest sum of variances, linear combinations of the 
Vectors differentiating the bases may be used to obtain the final basis. 
In the simplest data basis, r of the nq vectors have been selected 
for special emphasis. This is not in accord with the thinking of 
those who would rely on more than т vectors for establishing г basis 
Vectors, However, approaches using more than 7 vectors to establish 
3 basis have not been considered because of the restriction to non- 
Regative entries in the matrix of coordinates which is the only type 
of matrix corresponding directly to the matrix of represented behav- 
1015. The cluster approach and the principal axis approach may be 
Modified so as to result in a final non-negative matrix of coordinates 
ving direct behavioral meaning. Orthogonalizing on bases derived 
rom clusters of vectors is the way in which such bases may be 
‘ndled. The use of principal axis methods is described in the 
Succeeding section. А 
ће development to this point may be stated as follows: The sim- 
plest basis is that data basis expected to exhibit invariance or stability 
under Additional sampling of responses of subjects, and under adding 
= categories to the classification system. Such КА еі 
in А Cted for the same reasons that factorial invariance i кале? > 
eim analysis. Lengthening the column vectors A t aad + 
Во S by additional sampling would require that the a te 
in 18 so change the data structure that the vectors тасының were 
s the Structure become the longer vectors and that those № és is 
Р longer vectors become the shorter vectors. о Ева 
Sir Іп the S matrix are determined by the раритет ы. 
dona] Subjeets would necessarily have to be quite different in 
We Ihr fn e agn ipei sta reme 
in the mid ЗЕ, change to: dap Seo seas ЛД happen 
Jects while they were observed. That this co 
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jects 
cannot be denied, but that it would happen differentially for subject 
from two samples under similar conditions seems unlikely. -— 
The simplest data basis is that most nearly orthogonal, a is Mié 
the coordinates are non-negative, and for which the value of г sta 
highest. When a data basis is associated with a unit нае di 
basis is orthogonal and unique; no other basis meeting the = ee 
of non-negative coordinates can then be obtained. When t 1€ Vitali 
est value of c is less than unity, a kind of uniqueness can still = m 
in the sense that just one basis is associated with the highest uu 
In this case the basis is oblique and is the most nearly orthoge jue 
basis. When more than one basis is associated with the highest 3 p* 
of c, taking linear combinations of the vectors еки 
bases provides a satisfactory resolution of the indeterminacy af " in 
by two or more simplest bases. The indeterminacy is eee 
the sense that a value of c for the final basis close to that of the en 
est value will be obtained. The value of c for the final data = 
used may be viewed as a measure of internal productivity. It p 
be rare that more than one basis will be associated with a value of €? 
in practice the indeterminacy of basis will 
A substitute for c entailing less 
index of internal productivity 


seldom be a problem. де 
in the way of computation 15 


1-и (5) 
r(r — 1) 


where wu is the sum of the elements of the ail 
unit basis vectors and r is the number of vectors in the basis. Ў га 
internal productivity is maximal, the index of internal productivit? 
attains a value of Unity; when interna] productivity is nil, à we 
of zero. 
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WEIGHTED DATA VECTORS AND BASES 


In considering the simplest data b 
of inner products on simplest bas 
because they are maximally sepa: 
vectors which are orthogonal, v. 
basis vector, and other vectors with high cosines on the basis vector 
will have zero to low cosines w 


-— sanct 
asis 1t can be seen that the varian e 
es will be near to а maximum valt 
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im for vectors with high cosines with a basis vector. In the case in 
wer а аге collinear with the vectors of an orthogonal basis, 

venga of the cosines of the data vectors with the basis will be a 
maximum in the sense that the sum of the variances of the cosines on 
cach basis vector will be a maximum. This is so because any rota- 
tion of basis vectors will result in additional non-zero cosines of data 
and in the reduction of the higher cosines 
Consequently, it is to be 
rs on 


vectors with basis vectors 
with a consequent reduction in variance. 
expected that the sum of the variance of cosines of data vecto 
simplest basis vectors will be near а maximum value and will be 
greater than similar variances for the great majority of data bases. 


It could be that for some non-orthogonal basis which is not à 
ances of absolute values of parallel 


, asis than for the simplest basis. 
This may be so wh asis are members of 
clusters of vectors with very high inter-cosines. But the difference 
ШІ the sums of variances would not be great and the bases would in 
fact give similar matrices of coordir imilar in fact as to gener- 


nates, SO 51 
ate highly similar or identical hypotheses and interpretations. 
The main point in the preced 


ling discussion is that the simplest 
data basis is quite similar to that data basis with the greatest sum of 
variances of cosines on basis vectors. Hence the simplest basis is a 
data basis for which the data structure is, ог nearly is, expressed in 


the simplest and most parsimonious way for a basis of r vectors. 

When all data vectors are collinear with the vectors of an ortho- 
normal basis, this simplest basis is the basis of unit principal axes.* 
This can be illustrated for the two у noting that it 
is desired to find the maximum value of 


simplest basis the sum of the vari 


projeetions might be greater for the b: 
en the vectors of the simplest b 


-dimensional case b; 


Ta >= (А: сов 0)? + 2 (һҙт/2 — cos 6)? 
1 


s; mis the number of data vectors 
s; n — mis the number of data 
asis vectors; 0 is the angle between 
first principal axis; 7/2 — 0 is the 
angle between the other basis vector and the first principal axis and 
h: and №; are the lengths of the ith and the jth data vectors. The first 
derivative of T with respect to 0 is csin 0 cos 0 where cis non-zero. The 


where n is the number of data vector: 
collinear with one of the basis vector: 
vectors collinear with the other b: 


one of the basis vectors and the 


*'This was pointed out by Professor Charles Wrigley- 
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derivative can only be zero when sin @ = О or cos à = 0. = M 
happen only when 6 = 0 or 7/2. Solving with @ at zero and a 
gives the two extremum values. Land 
Іп this ideal case the data structure is completely ООН ce of 
any orthogonal rotation of axes will reduce the sum of the ae 
the coordinates on cach basis vector, Also, introduction of new 
vectors not collinear with the ba : T. 
principal axes located nearer to the “center” of the configuratio 
vectors in the sense that 
centroid axes than before. 


; "F у set of 
isis vectors will result in a new 


they will more closely approximate à 
In 1 his case of an oblique data jubere 
the principal axes will represent the data vectors in a highly pipe ete 
manner rather than in the simple and parsimonious manner 0 aih 
simplest basis. This will be reflected in the lesser sum of the sin in 
powers of the coordinates of the data vectors relative to the case 
which, for the same number of dat 
in the oblique case, the а 
Structure, 


ч ngths as 
а vectors with the same length ае 
4 hogond 
ata vectors have a completely orthog 


Since it is those vectors m 


, а vectors 
aking the configuration of data vect 
oblique which locate the 


bi id axes, differ- 
Principal axes near the centroid axes, d the 
ential weights may be applied to the data vectors according to 
obliqueness of the data structure 


ary vectors and those ve 
vectors than to vectors 


80 as to give more weight to bound 
ctors having high cosines with the boundary 

With low cosines with the boundary NEQUE 
The vectors contributing most to the obliqueness of the data ee" 
will in general be the longer vectors whereas boundary vectors an 
vectors with high cosines with the boundary vectors will tend to be 
the shorter vectors. It seems reasonable, therefore, to weight data 
vectors so as to relatively shorten the “central” vectors and relatively 
lengthen the “outer” or boundary vectors. 


Product of each data vector with the ргіпсі- 
Teases the үлгі 


ance of the coordinates along 

rows of the Principal axis matri oes using the squared cosine. 
. 1 
Įuared is equal to or less thal 


топ Variation and increases it 
The cosines of data vec 
pal axes are the Proportions of the data ассо 


when all vectors have been given equal w 


ors on the princi- 
unted for by the axes 
eight and P:/p is the propor- 
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tion of the data accounted for by the kth principal axis. Therefore, 
among available constants, weighting directly by the cosines and in- 
verscly as the proportion of the data accounted for by the principal 
axes maximally increases row and column variation in the matrix of 
coordinates of data vectors on principal axes. 

. The effect of row weighting and column weighting is to greatly 
Increase variation among (һе clements of the matrix. The increase 
in variation is differential with respect to single vectors. Those 
Vectors with high cosines with the last principal axis will be relatively 
Increased in length and those with high cosines with the first principal 
axis will be relatively decreased in length. Also the similarities and 
differences between pairs of vectors as measured by cosines will be 
accentuated differentially, the accentuation being most marked for 
those vectors lengthened the most. And in general those vectors 
lengthened the most will be the boundary vectors containing among 
them the vectors of the simplest basis or bases for the boundary 
Vectors and will have the highest cosines with the r — 1 last princi- 


pal axes. 


Let 
G = PP (6) 


the coordinates on principal axes of 
ons, (1/t)SS’. Then G represents an 
decision that enough basis vectors 


Where P is the ng X r matrix of 
Ro, the matrix of joint proporti 


Approximation to Ro based on the Bol 
have been selected. Also G has rank 7 whereas the rank of Ко is in 


general пу — n + 1. The r principal axes of G are the same as 
the first » principal axes of Ro and P is a sub-matrix of the principal 
axes of Ro. Let D be a diagonal matrix with elements of normalizing 
constants for the rows of P, and K a diagonal matrix with elements 
P/Pr where p and рь are the sum of the latent roots and the kth latent 
root, respectively. Then matrix DPK is a matrix of the proportions 
of unit data vectors giving the proportion of the data vector as repre- 
sented in P accounted for by each principal axis weighted inversely 
as the proportion of the data vectors, (not in normalized form) 
accounted for by the principal axes. DPK is then such that the 
Vectors with the highest cosines on the r — 1 last principal axes will 
become the longer vectors and those with the highest cosines on the 
first principal axis the shorter vectors. 


Taking the principal axes of 
DPKKP'D =L (7) 
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gives the weighted data vectors expressed in terms of the nae 
their own principal axes, denoted by M. The nq Xr matrix, | a 
will reflect to a certain degree a reversal of the roles of the me 
axes of Р, Kor example the last Principal axis in M will ciem 
or nearly all small non-zero coordinates and the vectors with ially 
weights on the last two basis vectors of DPK will have pan 
significant roles in determining the first principal axis of L. in 
the variation in Coordinates wil] change greatly although the lengt : 
of the vectors of DPK and of L will be the same. The highest = 
lute values of coordinates will be numbers such as, say, 10.5, ignem 
the zero and near-zero entries of DP will remain at zero and кекете 
values іп Г. Thus it may be said that the boundary vectors yox 
will be accentuated the most т L. Yet all vectors participate in t 
formation of the Principal axes of L. TT 

It was conjectured earlier on the basis of factor analytic thinking 


р s 2 able 
and experience that the simplest data basis would be the most sta 
or most nearly invariant data basis 


rause the simplest data basis is similar to 
ader the restriction of non-negativity and о 
àn orthogona] bases. Тһе simple structure solu- 
ably more stable or invariant under 
addition of new tests to the bat tery when 
rincipal axis or centroid bases. 
The simplest data basis and the simple structure bases usually 
ata vectors. On the 
Sof L weight most heavily those vectors 
of the simplest data basis an tors having high cosines with 
them while at the same ti i "rive m all vectors, For this 
reason it seems likely that ix, M, of coordinates on 
the principal axes of L is the Most stable or Most nearly invariant 
matrix of coordinates obtainable from the data vectors while at the 
same time Possessing the advantage of being unique. The unique- 
ness permits, therefore, a decision betw 
unweighted data basis corresponds + 
seems also that the matri 
tion to the simple structure 
the fact that it must have f: 
factorial invari 


o the highest value of c. It 


actorial invari 


ance 
ance and by reason of the 


fa 
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Сын solutions, the principal axes of L provide an objec- 
"eh ee = selecting one (or neither) of the solutions. Ап example 
ded ysis of data based on matrices L and M is given in Appen- 
Ші... сее squares of the coordinates of the row vectors of М 
hae ta aoe asa measure of invariance. The measure of invari- 
s ne same as that obtained by taking the sum of the squared 
ben stipes of row vectors of DPK. A weighted, and better, measure 
мој е ueni: from M by multiplying each coordinate of the kth 
the ote axis by the proportion of the kth latent root to the sum of 
5 a ent roots. In this way each row of elements of M is weighted 

У the invariance of the principal axes of L as well as by the invari- 
ance of the data vectors, those vectors having the highest weightings 
being the vectors with highest coordinates on the first principal axis. 


Although the matrix M renders the selection of vectors having 
ceptible to visual selection, comput- 


the highest invariance values sus 
be employed. One such 


ing routines for clustering the vectors may 
routine follows: 

1. Select 2(r — 1) vectors from M by taking from each of the 
first r — 1 columns those row vectors with the highest positive and 


H 
Negative coordinates. Add to this group, if it is not already a mem- 


ber, the vector with the highest invariance value. 

2. Compute cosines between members of the group of 2r — 1 
Vectors. Since there will finally be 7 vectors of a basis 27 — 1-7 
Vectors will have relatively high cosines. Eliminate the 2r — 1 — r 
Vectors by comparing cosines. Those vectors having high positive 
cosines with another vector are the candidates for elimination. ОЁ 
two vectors having a high cosine, the vector with the lowest invariance 
value is to be eliminated. 

3. With r linearly independent pivot vectors selected, compute 


cosines of all data vectors with the r pivot vectors. If any vector has 
a pivot vector and a higher invariance value than 


а high cosine with 3 
the pivot vector, it should replace the pivot vector. Also if any 
all of the r pivot vectors and a higher invari- 


Vector has low cosines оп | 
ance vector than any one of them, it should replace the pivot vector 


with the lowest invariance value. 
The final result is 2 set of r linearly independent pivot vectors 


which may be ordered in terms of their invariance values. А cluster 
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can be sensibly defined as a group of vectors having higher ра 
cosines with the pivot vector than the pivot vector has with the = 
ра! axis on which the pivot vector has its highest cosine. A " em 
во defined may contain vectors with relatively low invariance và ue 
The cluster may be further delimited by requiring that the a 
invariance value for the members of cach cluster order the cluster 

the same way as the invariance 


value of the r pivot vectors. 
The row vectors of 


А ‘vot vectors oF 
M may be weighted by using pivot ice for 
: ; ata basis 
by the use of clusters of vectors so as to obtain a final data ba 
the weighted data vectors. 


:ves a basis the 
Use of the pivot vectors gives a basis 
individu. 


al members of which have the highest possible е. 
values consistent with the dimensionality of the data ршн the 
use of clusters results in basis vectors which are combinations of t 
vectors in each cluster, and thus h 
stability. The vectors in each с 
1. 


ave some advantage in statistical 

luster may be combined as follows: 

Weight the row vectors in M corresponding to the ith cluster 

by the inner product of each row ve 

2. Weight each row ve 
ance value. 


s actor. 
ctor with the pivot ve ctor 

у : J] invari- 
ctor of the ith cluster by its weighted inva 
3. Sum all weighted vectors in the ith cluster to obtain a centroid. 
4. Normalize the centroid. 


The procedure described weights ү; У 
the pivot vector and by their invariance values with greatest weight 
being given to the invariance values. 

The final basis of Weighted data vectors should be used to deter- 


mine a basis for the unweighted data vectors. When the г pivot 
vectors are used as a final basis for 
vectors in the matrix of principal 
may be normalized and used as a basis for the unweighted data 
vectors. When this is done the final basis of unweighted data vectors 
will often be a simplest data basis, When there is more than one 
simplest data basis, the final basis so derived is the preferred basis. 

When linear combinations of row Vectors of M derived from 
clusters of vectors have been employed in obtaining the final basis 
of weighted vectors, the orthogonal transfor: 
the weighted vectors of М 
terms of the matrix of princi 
basis vectors) has the form о 


КЕСЕК instr t0 
ectors both by their similarity t 


M, the corresponding unweighted 
axes, P, of unweighted data vectors 


Pal axes, P. 
f Equation ( 
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с td ram > The transformation matrix, X, is the rX r 
YORE and m € unit patina axes of M being expressed as 
ет не н. ia à ход bul axes of P expressed as columns. 

на in ue E he ? vectors of the final weighted basis are ex- 
са ne i iei of the unit principal axes of M, denote the rx? 
dt 3E tee А is of unit basis vectors with the principal axes 
екені 5 Then MX is an rX r matrix expressing the r basis 
k: "m e t ле weighted unit data vectors of M as r rows in the frame 
бата о axes of P. And P(MX)' gives the ng x r matrix of 
Pean ка ts of the unweighted data vectors on the basis of weighted 
dines. Y ei The mat rix of cosines of the vectors of the basis is MM’ 

XX’ is an identity matrix. 
Š Es “props the vectors of the basis given by the rows of М are not 
un est data basis. Thus the ng x r matrix of coordinates of the 

ighted data vectors on the vectors of the basis of weighted data 
Veetors, given by PX'M'OIM))^, will contain negative elements 
and rotating to an orthogonal basis approximating the oblique basis 


will be required. Even though the matrix of coordinates is not in 
ferred oblique basis for the unweighted 


rom those vectors having the high- 
titutes a good basis from 
1 basis (о be used for hy- 


опот . "— 
: neral non-negative, it is а pre 
ata vectors because it is derived fr 
It therefore cons 


te i "am 
St invariance values. 
na 


Which to obtain an approximating orthogo: 
Pothesis formation and inference. 

| The discussion of weighted data vectors and bases may be summar- 

ized as follows: Weighting data vectors and bases by normalizing 

data vectors and weighting by the inverse of the proportion of latent 

he latent roots minimizes the role of the first 

ference frame. As such it mini- 


roots to the sum of t 
ning the re 
havior classified. The 


principal axis in determi 

mizes that which is common to all of the be 

weighting process increases the cosines of unweighted boundary vec- 

tors already having relatively high cosines and decreases the cosines 

of unweighted boundary vectors already having low cosines. The 
e first principal axis are, on the other 


vectors with high cosines with th 


hand, considerably shortened. 
f the matrix of inner products of the weighted 


The principal axes 0 
data vectors emphasize what is common to the unweighted data 
vectors dissimilar to the first principal axis in a manner consistent 
f the data space. They also yield measures 


with the dimensionality © 
of the stability or invariance of both single vectors and of the princi- 
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" n 
palaxes. Тһе matrix, M, of coordinates of weighted data E 
the principal axes may be regarded as the most nearly inv ge 
matrix of coordinates for reasons quite analogous to the mel “~ 
involved in the concept of factorial invariance in factor — a 
From M the r linearly independent most nearly invariant — nee 
combinations of ғ groups of vectors may be used as r unit мает 
and expressed as coordinates on the principal axes of the unw т a 
data vectors, When the » most nearly invariant vectors are use n 
basis for the unweighted data vectors, the basis will either be "eed 
plest data basis or one very similar to it. When clusters of weig rs, 
data vectors are used to form a basis for the unweighted —— : 
the basis will not be à simplest basis but again will be similar "€ 
simplest basis. In this case, it is necessary to rotate to an orthogon? 
basis approximating the oblique basis, 


Р x sill not 
In general bases formed from unweighted data vectors wil 


have maximal stability but will provide bases satisfactorily appro 
mating the invariance of the principal axes of weighted data vecto 
A distinctive advantage of M is that tors 
formation. A distinctive advantage of data bases is that the vecto 


Е г абд м k ~ roups 
of the basis are similar one by one to relatively widely spaced ic^ 
of vectors; thus the Process of hypothesis formation is rendered les 
difficult. 


-inato in. its 
all vectors participate in 


TYPES AND TRAITS 


The preceding discussion, 
S matrix, that is, 
responses, may be considered to be a discussion of traits. Following 
the lead of Thurstone (1947), a trait may be defined as a characteristic 
lation to what it does 
ons: On the other hand, types refer to 
ects specified in terms of their traits. A 
be equivalent to an analysis of the data 
alysis of the n 
ill be the same as 
es on the i 

he second 


case it will be ¿X 7, ¢ being 
It should be 


remembered that both traits 
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са ses ср from classified behavior deserve only as much 
ов не Е the classification system warrants. The warrant of 
shia s bu e from the results of the data analysis which serves 
йын cem or the definition of types and traits. ‘When relation- 
litus sui e or traits to variates external to the classification system 

‘ound, an additional basis for confidence in the classifica- 


tion system has been established. 


A 
NALYSIS OF THE DATA MATRIX S 


ata matrix, 5, 


Sine : 
€ only unit and zero entries are permitted in the d 
is the longest 


oe vector, designated as Y, with unit entries only s ong 
ны е vector. Whether or not it explicitly appears іп 8, it is a 
Sine снг of the а rows corresponding to any response т. 
нені has 1 unit entries, the length of Y is УТ and the normalizing 
ай "dd is 1/Vt. Normalizing all row vectors of S to Y and post- 

ultiplying S by its transpose, gives 
a/v) SS’ (/У) = Ro (8) 


With general element P;; and diagonal element Ри. The square sym- 
roportions. It should be 


Metric matrix Ао is thus a matrix of joint p 
Doted that the diagonal entries of Ко and the entries of (1/0) SY’ are 
identical. 
It ean be verified that 
sy’ = DZ' (9) 


and 
Ss' 2' = nDZ' (10) 


Where D is the diagonal matrix formed from the diagonal entries only 
and Z’ is a 1 X ng matrix with unit entries only. Pre-multiplying 
(9) by D~ gives 
pss’ 2' = nZ (11) 
which shows Z' to be à characteristic vector and п as a characteristic 
or latent root of 07758". | 
Pre-multiplying (11) by 8” gives 


(S'D38)S'Z' - nS'Z' (12) 

but S'Z' = nY' (13) 
, 15 23 = (4 

(80-15) тҮ (14) 


Therefore 


36 


THE ANALYSIS OF EMPIRICAL CLASSIFICATION SYSTEMS 
showing n and Y' to be a characteristic root and a m 
vector, respectively, of the symmetric matrix (60-15). in an 
more, n must be the largest such root for the square matrices 0 ү" 
and (14) because under the one-and-zero coding system employ: “= 
obtain S, Y and Z’ аге the longest vectors in the row and co 


4 inserting 
spaces implied by 5. Pre-multiplying (11) by D"? and inse 
DD? between S and Z' gives 


15) 
(Р-р) Deg! = прид" ; 


The matrix ром D-1, representing cosines of angles B 
the row vectors of S, is symmetric. Therefore, D'2Z' is a oor 
пе vector of D-!288' Di and п is the largest latent root. Furth 
more, (1/4/ Ре’ is the first principal axis for 


Ирма = (1/0, + +... + ty) on 00 


and by definition the inner product of a princip 
itself is the characteristic root associ 
vector. Post-multiplying the matrix 
principal axis, considered as a column 
gives an ng Х nq square matrix whic! 
mally. The scalar element of 0-15, 
product moment correlation for со 
could be accomplished by 


al axis vector with 
ated with the characterist 
of coordinates on the ae 
matrix, by its transpose then 

h reduces D-/2$,8’D-/? maxi- 
570-18 is the formula for the 
mmon elements. Analysis + 
taking the first principal axis of D-!288D à 

ав the first basis vector; the second principal axis as the second basis 
vector, ete. However, the analysis will, instead, be based upon fte 
We note, therefore that the entries of п/у”, being entries of the 
first centroid of Rg, will usually be very similar to the corresponding 


entries of the first principal axis because obliqueness of data structure 
will usually obtain. 


The ng X nq matrix of unit rank {{(1/t)SY']} 
{1 /QSY')} or equivalently, Q/0DZ'ZD(0), will then reduce № 
efiectively. "This unit rank matrix has entries Р.Р; the expression 


for the expectation of P;;, the general element of Ro, when row vectors 
Тапа jare independent. The remainder matrix 


Ri = № – (1/00 0 (17) 

has rank r — 1. 
The greater an entry in (1/4) 
corresponding to the entry. He 
more it resembles Y and the les 


SY’, the longer the row vector of S 
nce the longer a row vector in S, the 
5 the angular Separation between the 
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in the “center” of the con- 
e fact that the inner 
ner products 


DM Clearly then Y is, in a sense, 
rahe " Јане vectors of S. This fact and th 
ill tora: erow vectors of S with Y are maximum in 
The ны obtaining а final form for the analysis of S. Е 
- Das minder matri discussed above will contain both positive 
ative entries. The negative entries correspond to the fact 


tha о ег 1 m ^ 
~ each subject must “avoid” q — 1 sub-classes of the classification 
em in order to “select” one. The positive entries show & joint 

dence in S due to Y has 


«m to “select.” Since the linear depen 
by the е ip the first remainder matrix Ві, = vit pe Чисте 
stand to % ст я 7 на аре ~ == 
Ru Whenwer x һеп another basis les : y be Ages 
column normalized to the diagonal value, the entries treated as & 
unit r matrix and post-multiplied by its transpose, gives à second 
а вес ank ng X па matrix, which, when subtracted from Ry results in 
se G remainder matrix, Аз. Alternatively, à linear combination, 
asa as the centroid vector of the first remainder matrix, may be used 
inf second basis vector. In fact the whole collection of methods used 
pr analysis may be applied to obtain a set of 7 basis vectors. 
n some an ng Xr matrix equivalent to S by employing successive 
mainder matrices is a routine matter depending only upon à choice 
of a single basis vector from each of the Ri Ra +++ R, remainder 
Matrices. However, the use of remainder matrices is cumbersome, 


the ; ; rans- 
he ng X r matrix of coordinates оп basis vec n be € 


f і Іп 

ormed to the ng X r matrix corresponding to t 

general, obtaining successive remainder matrice 
nnecessary labor. 


For data matricós, S, which ara па X t, 83 the number of basis 
inates of the 74 row vectors of S upon the 

Consider for example, the 
nted in R and, in general, 
Then, as basis vectors 
erage decrease; 


basis vectors must in £e 
long vector, Y. It has uni 
cannot be a basis vector for 2 
are ejections о: 
the taken from Ro P ыр of basis vectors increases. The scale 
can be held constant by normalizing the vectors defined upon a given 
basis vector by the 0007 inates of the row vectors of S to the vector 
defined upon the pasis vector by coordinates of the long vector Y. 
Doing this for e2¢ pasis vector gives à standard scale for interpreting 
projections no matter how many vectors in the basis. 
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2 : " tations 
Tn order to simplify the ensuing discussion, the follow ing no 


ате used: (18) 
Ro = (1/0) S8 9 
1 
Дер =P, — Q/P)DZ'ZD = 
2 
Now Ry = ARA! 


қ МЕР” inates) of 
where A is an ng X т matrix of parallel Projections (coordinat 
Tow vectors of S upon th 


5 Ky» 
S * 7 simplest basis vectors and Ra is ther X 
$ of the simplest basis veetors. Also, 
Ro = VR,V’ а 
og- 
where У is the 74 X rmatrix defined by taking unique vectors orth A 
onal to all of the basis vectors represented in R, but one. Matrix 


у hich 
пег products of those particular basis vectors one 
Thurstone calls reference factors. The relationship of the vec 
expressed in R, and R, сап be expressed as follows 


Ra = DRD - 
where D is the matri izi 
self-inner Products 
expressed in R 
shown below. 


Let 


i т 
appear in the diagonals of Rx. That each Pao 
but one of the vectors of Ra i5 


Ra = рр (28) 
where Fisa triangular matrix obtained by use of the Gramm-Schmidt 
orthogonalization Process, | 


п P the unit length basis vectors ex- 
pressed in R, in terms of their mutua] inne; Products are now ex- 
pressed as unit, length row уе 


Cors. Since p is r X лапа has rank 7, 
it has an inverse and 
РЕ- ү (24) 
also 
FFD- p (25) 


where D is the diagona normalizing constants for the 
column vectors of F-1, This is the Same diagonal matrix as that of 
(22), also, 


Ra~ = Fp 


and thus from (22) and (26) R, is the norma]iz 
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from (25 
5) a 
"ed E ean - column vectors of F~ are just those vectors 
s orthogonal to all vectors of the sim) i 

Mise facie s e simple basis but one. 

Where C j iac 
A қ 
Thus m E ап nq X г matrix of coordinates on àn arbitrary ba: 
defining ethod of factoring a square symmetric matrix suffices to 
шақалақ an ng X r matrix equivalent to № which can be carried to 
"adn matrix, 4, whose entries represent parallel projections on 
ors of a simple basis. 


(27) 


sis. 


THE 
Q-TRANSFORMATION 


ner product of the 


Once д 

long En A, and Ra have been determined, the in 
ctor, Y, with itself is given, when the rank is two, by 
(28) 


yi + ya + 2уугфа 
e cosine of the angle 


w 

аа a are the coordinates of Y and Ф is the 

higher re the two basis vectors Equation (28) is extendable to the 

Nami imensional case. When rearranged as à symmetric matrix, 
ion (28) shows in the diagonals the proportion of subjects in the 


el Е 
ass of subjects defined by each basis vector. The off-diagonal 
common to both classes. 


entries o; 
= give the proportion of subjects 
wae define the diagonal matrix @ with the entries being the coordi- 
es of the vector Y on the basis vectors; that is, yı апа у» Then 
Ro = (4Q- (QRQ) (QA 8) (29) 
obtained by arranging the terms 


Ink А i 
i Equation (20) QRaQ is the matrix 
о Equation (28) as a symmetric matrix, the elements summing to 
Unity, The entries in AQ- are the vectors, defined on the basis by 


the row vectors of S normalized to the projections of Y. Equation 


(29) introduces à standard scale of reference. If it is desired to use 


с. 
proximating the simplest basis, the Q-transfor- 


àn orthogonal basis ар ipi 
mation should be applied to the matrix of inner products of the orthog- 
Onal basis vectors. Then Equation (29) becomes 

Ro= FQ^(QQ)Q^F' (30) 


where QQ is à diagonal matrix because the basis is orthogonal and Р is 
the ng X т matrix of coordinates on the orthogonal basis. 


40 


5 
SYSTEM 
THE ANALYSIS оғ EMPIRICAL CLASSIFICATION 


у inner 
A Particularly interesting ease arises when the ах d 
Products of the vectors of the simplest basis is an ped ere 
Then Equation (30) is an expression of Equation (29) and the s ӨШ 
of FQ- may be interpreted as dependent probabilities whore the 
entries of QQ may be thought of as independent А 
probabilities referring to the Probability of the class itself. шекеге 
such an interpretation seems not to be particularly useful for equ 
tic investigation, In naturalistic investigation the e cii are: 
embodied in the classification system. The questions of interest 6 


1. How many classes 


А rectors 
exist; that is, what is the number of v 
in the basis? 


2. What is the structure of the data? 


3. Is the internal 


sc for 
a base 

Productivity sufficient to serve as 89 
further hypothe 


Sis formation? 
These questions are answered j 


У roduc- 
n terms of the index of internal pro 
tivity, in terms of the rar 


‘dinates of 
ak differential, and in terms of слива ере 
data vectors, The problem is not to test hypotheses concerning 


А А from the 
existence and number of а Specified set of classes but, rather, from 
Observation of behavior 


~ ow 
sis of data strueture to find h 


"ucture. 
of classes arc implied by the data structu 
In short, the aim is to Produce more 


The reduction of the matrix of Joint Proportions, Ry to an 
тд X т matrix of Coordinates on orthogonal] basis vectors, with 7 
taken as less than the actual rank Ro, тау result in small negative 


inner products between the row Vectors of the matrix. Negative 
inner products May be eliminate, 


d by the Use of “communalities” 
analogous to the communalities of factor analysis, The паво com: 
munalities implies substitution of the о; 


rigina] set or Vectors by a new 
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Set. rexer и 
usa MM En as such a substitution may appear to be, the 
vectors and iik ities has in fact the effect of altering the lengths of 
uf роб ке lr angular separations while leaving the configuration 
hypothesis f ној unaltered for purposes of inference and of 
ormation. 
Consider 


Ry = Ro - (1/0)8у”Ү8” 
t cen- 


Where R, i 
1 is the first residual matrix after extraction of the firs 
axis of Во is usually similar to the first 


troid axis 
d axis. The first principal 
ропзез апа 4 alter- 


centro} " 
ue = = Given an S matrix based on n res i 
of the : 2 4Х4 principal sub-matrices in Rp corresponding to each 
Рае SPONSES are diagonal with non-zero elements of Pi. In № 
РА — i sori 4 X q principal sub-matrices have diagonal entries 
i) with off-diagonal entries —P;P;. The sum of each row in 


t 

тұн q principal sub-matrices of Ја is zero. 

а estimate for the diagonal values of Ri may be 

onal 56; 95 in factor analysis, by substituting the highest off-diag- 

( entry in the rows of Ја for the diagonal entry excluding the 

Mun 1) entries in the q X 9 principal sub-matrices corresponding to 
response. Тһе off-diagonal entries of the (0х 9 principal sub- 


Matrices are negative. Therefore it is desirable to increase these 
The adjustment of all elements in à given 
onding to à given response may be 


multiplication of the principal sub- 


with elements 


entries toward zero. 
oti 4 sub-matrix of В, corresp 
ме omplished by pre- and post- 

atrix by the diagonal matrix W 


_ МРа= P Pall 
wi = pP — РО 
w of the qX 
alue of the highest off-diag 
of the ith row and РТ — pj is the diagonal element. ‘Such a pre- 
and post-multiplication of each q X 4 principal sub-matrix will result 
in a new 9 Х 4 sub-matrix with |Ра- PIA in the diagonals and 
will move the negative off-diagonal entries toward zero. The new 
matrix may be designated by Ri and succeeding matrices R5, etc. 
may be obtained in the same manner. There will be (r — 1) of the Ri 

nding ng X 1 matrices of coordinates on 


— 1 correspo 
d from each. Denote the па X (r — 1) 


q principal sub-matrix, 


Where $ denotes the ith ro 
onal element 


| Pa — Р.Р is the absolute V: 


matrices, and 7 


basis vectors determine! 
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ing the 
matrix of coordinates by F, post-multiplying by F’ and adding 
^d X nq matrix of rank one, (1/е)5у уд’ gives 

Ri = рр’ + /?SY'yYs' — 
Where the elements not in the 2 x 4 principal FM eere of Bo 
Sponding to the responses are approximations to the element Неге 
Ti; and R, differ principally in the elements of the 4Х4 и ренин 
matrices Corresponding to the responses, In №, these unde ба 
are diagonal, in Ro they are not necessarily so. Furthermore, 


th 
H H ^»etors of bo 
Veetor with elements Piis a linear combination of the vecto 


id of 
а . ntroid 0 
Ro and mj. However, this vector is not necessarily the ce 


tors 
. H - H e үес 
іх Ro, being the matrix of inner products of м f behav- 
X S, which is a representation de non- 

г. However, as is shown by RE corre- 
zero inner products + Х 4 principal sub-matrices or. ir to 

& matrix referring payee’ 
. H 5 e 
behavior, Instead R? тау be taken ав referring to response 
o May g 

tion or respon 


cclu- 
: excl 
аге not necessarily mutually 
Ponses Classified are. 


ee Sere: | Е be un- 
alysis, Mitial communality estimates may 


z able 
or analysis, iteration to st 


OW vectors of 5 were 
; that is, each vector is Weighted by the constant, 1/ > 
So normalized to Y, the Matrix of Table 2, Ro, is 2 symmetric matrix 
of joint Proportions, 


with the second principal axis. 
and 13. Тһе inner products of the 
and 13, taken as unit length basi: 


ese are th 
Temainin 


Б Vectors With vectors 1 
S vectors 


» ате given in Table 4. 


TABLE 2% 


Dara M 
А MaTnIX оғ Іххы 

INNER Pnopvers, Ro. (Fictitious РАТА ОР TABLE 1 
19 13 14 15 16 


) 


1 9 : 
3 + 5 6 7 8 9 10 11 


1 56 207% 
9 5 2- a 4S 08 32 24 0S 48 04 52 16 40 32 24 
3 б 28 32 12 24 20 08 36 16 28 28 16 
4 48 00 40 Os 28 20 08 40 08 40 16 3 
5 52 40 12 28 24 08 44 12 40 28 24 
6 80 00 44 36 16 64 16 64 30 50 
7 20 12 08 00 20 04 16 14 06 
8 56 00 08 48 16 40 24 32 
9 44 08 36 04 40 20 24 
10 16 00 04 12 0S 08 
п 84 16 68 36 48 
12 20 00 16 04 
13 80 28 52 
14 44 00 
15 56 
16 

Р 

Decimal Points are omitted. 

TABLE 4* 


PropDUCTS OF 


А, INNER 
кстоня OF TABLE 3 ох 
F THE OBLIQUE 


MATRIX 
Row V. 
тне VECTORS 0! 


TABLE 3* 
СТМРЬЕЗТ BASIS 


Ма 

ATRIX P ор 

^ X P or COORDINATES ON 
RINCIPAL Axes OF Ro 


њом ә 
O00-005&oti 
- 


10 85 

11 18 

12 82 t 

13 41 5 5 

7 4 m 3 45 69 

15 68 A á " 75 

5 E = Y 84 75 
* Decimal points are omitted. 


= “іа. 
Decimal points 279 omitte 
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TABLE 5» 


Marnrx, Ra, оғ INNER Pnopvcrs 
OF VECTORS оғ THE OBLIQUE 
SIMPLEST Basts R, 


a 


* Decimal Points are omitted, 


Table 5 shows the matrix 
inner Products, p 
Vectors of the sim 
vectors being 

The index of in 


TABLE 6» 


Матыах, Р, оғ Coorpinarns ON 


ORTHOGONAL Basts ; 
THE OBLIQUE Бім 


09 

2 09 56 
3 53 10 
4 23 55 
5 69 42 
6 05 23 
7 43 34 
8 32 31 
9 11 11 
10 66 54 
11 02 25 
12 73 40 
13 08 57 
14 67 08 
15 87 St 
16 39 0) 
Y 76 65 


* Decimal points are omitted. 
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* 
TABLE 7 


Matrix РО! 


P - itted. 
* Decima] points are omittec 


"able 9: 
computed from the data of we 
is .73, indicating considerable + jous 
nal Productivity for the fietit 
classification system. x F of 

Table 6 gives the matrix, [an 
Coordinates on the vectors : the 
orthogonal basis approximating 


ly- 
7 gives the results of re 
ing the Q-transformation. Ta m 
S the proportion of subjects 


TABLE g* 
Manny QQ or Crass р, 
58 


'ROPORTIONS 


^ Decima] Points аге omitte, 


d. 
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each cla NS 
It поене were not used in these computations. 
tation ofthe и such as those of Table 7 and 8 that the interpre- 
жү : "i ses of behavior should be made. The interpretation 
inthe Шыны ormulation and extension of the hypotheses embodied 
ation system. 


be represented in the form 
n terms of internal produc- 
on of original hypotheses 


THE: dires 

ғ ханнан қас of subjects may 

tivity, D 5, which, when analyzed i 

He Fei dien 2, ап extension or reformulati 1! 

of behavior е m the classification system used. The original classes 

тони as represented in the data matrix, 8, may be replaced by 
q — п + 1 behavior classes, the vectorial representations of 


whi 
E veasa basis for the data vectors of 5. 
А funda given S matrix with rank 7 there will be many bases for 8. 
basis — problem is that of selecting the most appropriate 
appropri purposes of inference and hypothesis formation. The most 
à e iate data basis is the simplest basis consisting of r best bound- 
tion: ectors. The boundary vectors are those vectors generating 
"s negative matrices of coordinates and the best boundary veotors 
Among those at most 2(r — 1) data vectors having the highest 
cosines with the principal axes starting with the second and the data 
Vector with the lowest cosine with the first principal axis. The віш- 
Plest basis is the most appropriate basis because it is the data basis for 
Which the matrix of coordinates of the data vectors is expected to 
Vary the least under repeated sampling or under changes in the 


Classification system. 

_ An alternative, and perhaps better, basis than the simplest basis 

is the basis of principal axes of the weighted data vectors. The data 

Vectors of S are weighted by expressing the data vectors in terms of 

Coordinates on principal axes and weighting the coordinates on the 
ortions p/p where рь is the 


kth principal axis inversely 25 the prop i 
kth latent root and р 18 the sum of the latent roots. This weighting 


relatively lengthens the boundary vectors and relatively shortens 
those vectors largely accounted for by the principal axes with the 
larger latent roots- Since the vectors of the basis are determined by 
all of the weighted vectors, but largely by the boundary vectors, it is 
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expected that the 
basis will vary les 
simplest basis. 

Instead of usin 
of the weighted da 
vectors. 


veighted 
matrix of coordinates on the vectors of tire Е а. the 
8 from sample to sample than the coordinates 


ineipal axes 
5 the matrix of coordinates on the gai pes 
ta vectors, one may use a basis of r weighte 


If it is desired to use a basis for the unweighted data vectors repre 
sented in P (p. 39), the » vectors of Р » pivot 
T pivot vectors of М may be selected as basis vectors. Or the 7 jx 1 
vectors or the vectors of the basis derived from clusters of ни m Я 
тау be expressed With their coordinates on the principal og 
matrix P and used as the basis for the unweighted data vectors. 


У na 
application of the Q-transformation Will then put all coordinates 0 
Standard scale, It is the 


should be used as the ђе 
inference, When а basis 


's as the 
with the same numbers à 


CHAPTER THREE 


VARIATIONS ON THE S MATRIX 
AND WEIGHTS FOR SUBJECTS 


FOR 
MING $ FROM DATA MATRICES NOT IN 


5 

TANDARD FORM 

be in standard form 
on system, S repre- 
ations of behavior 


Tie 
Xploratory analysis the matrix S is said to 


me aan 8: unit of response and а classificati 
could Ъ escription of behavior. Other represent: й 
sented e used. For example an “answer pattern" may be repre- 
"— as a vector with one unit entry, all remaining entries being 
cw Then the answer patterns all have inner products of zero or 
is y when the vectors have been normalized. Such a procedure 
m be logically impeccable but in fact involves arbitrary operations 

n the vectors represented by the column vectors of S, for the infor- 
Mation represented in each column of S must be known before the 


M f 2 
€ctors with but one non-zero entry representing each answer pattern 
or some array of data logically equiva- 


e be constructed. Thus $ 

ent to S is logically prior to such a matrix representing answer pat- 
terns. What is of importance here is that representations of behavior 
which seem to be equivalent and to contain exactly the same informa- 
tion as 5 do not in fact do 50- 

We turn now to 2 more subtle problem of equivalence, that in 
Which different aspects of each of à set of responses are classified. 1% 
is not always realized that classifying aspects of a response may intro- 

+ be taken into account. А schematic 


duce dependence hich mus 


W. 
data matrix, in W s of each response have been classified, is 
, is 


hich aspect 


48 VARIATIONS ON THE S MATRIX AND WEIGHTS FOR SUBJECTS 


shown in Table 9. Each unit of behavior has been classified on the 
basis of two aspects called X and Z. In X, each response is classified 
into one of two sub-classes denoted by A and B. In Z, cach unit is 
further classified into two categories, D and E. At first glance, the 


TABLE 9 


Fictrrious Data MATRIX 


Main Aspects Sub-A Spects 


Responses of of Subjects 
n-8 Responses Responses 1234567891011 12 
Ж А 101001010001 
| В 010110101110 

1 

“ D 011000011 101 
Е 100111100010 
" A 001110101 0 1 0 
" 7 В 110001010101 
2 D 000110101010 
Е 111001010101 
x A 10100111100 0 
" | В 010110000111 
z D 110010000111 
Е 001101111000 


matrix of Table 9 seems to be ап appropriate matrix for 
shows just what each subject did and did not do and thus seems to be 
equivalent to S, which contains just one unit entry for each response. 


However, when this matrix is compared with its apparently equiva- 


'd dependence becomes apparent. 
f the matrix of Table 9. The 


analysis for it 


Now consider Table 10. 1% contains а а 
the same data that is recorded in Table 9 b 
exclusive classes from the combinations of th 
Thus, a unit classified as A may also be classi 


ata matrix derived from 
Y forming new mutually 
e sub-classes of X and 2. 
ified D or p. "Therefore, 
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TABLE 10 


Fictitious Dara тх Sraxpanp Form, S 


Subjects 


Responses Classes 
5678 


n=3 т-д 


5) 


1011 І: 


ы 


г 
© 


AD 
AE 
B,D 
B,E 


A,D 
АЕ 
B,D 
BE 


A,D 
АЕ 
B,D 
B,E 


-oco|s 
ноо е 
мнооо 
оноо | о 
ноо о 


[v] 


"ooo|loo-c 
=есојо- о 
-oooc|loo-oc 
нооо| ооо 
мәсоо|соосоьке 


0 
0 
1 
0 
0 
0 
0 
1 
0 
0 
1 
0 


Gsm e Bee 
o-ocloco- 


See Sil Scie: sieves 
он о о 


ооо н 
оноо 
“зе е | бе а и 
оноо|ооо 
оон о 
eobmre|joeoou 
oono 


Similarly a unit classified 


the с]; 
he classes (A, D) or (А, E) are possible. 
asses (В, D) or (В, Е) 


as 
S В may also be classified as D or E, so the cl 


ü ы 4 
Те also possible. Then we have, for each response, four mutually 
n, the number of responses, is 3, the 


Now, however, the vector Ж 
nse rather than twice for 


exclusiy , 
е clusive sub-classes. Since, 
ate i Е | 
ata matr ix, as before, contains ng rows. 


Occurs : 
curs but three times, once for each respo 
each response. Hence the maximal possible rank for exactly the same 


behaviors as those in Table 9 is 10. Thus there is more dependence 
m Table 9 than in Table 10. It follows that the vector space associ- 
ated with the first matrix is a sub-space of that associated with the 
Second, for although both are based on the same classification of the 
Same responses, the rank of the second is greater than that of the first. 
Elementary row operations will transform the rows of the first matrix 
into those of the second. But the rows of the second cannot be trans- 
formed into those of the first. It seems, therefore, that the form of 


the second matrix is more f undamental than that of the first. 


Every sct of classifications may be brought to the standard form 
be introduced by variations 


and problems of dependence which may à 
Írom the standard form can thus be avoided. The basic procedure is 
Simple. 5 сап be formed by requiring that for each response with q 
alternatives of response, there be one and only one unit entry, the 
remainder being zero entries. 
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MATRICES DERIVED FROM S BY HYPOTHESIS 


Once the data is in the standard form of S the investigator might 
have hypotheses which imply a matrix which embodies the hypotheses 
rather than the data itself. For example, the investigator might 
have the notion that thirds of interviews constitute a basic sequence, 
more basic than the Sequence of responses. When this is the case, 
S may be changed in accordance with the hypothesis. 

Let g be the number of collections of responses the investigator 
desires to keep in Sequence; the within-collection responses not to be 
kept in sequence, Then define the mat rix S, such that for each of the 
collections of responses there are ng/g = п. rows. S, from which S. 
is derived, has ng rows and S, has gre rows with gn, = ng. One such 
matrix, S., derived from an S matrix with n = 6, 4 = 3, is shown 
schematically in Table 11, 


In S. of Table 11 the Sequence of responses has been dichotomized. 
Therefore, the unit entri ame meaning as those in 


the matrix 5 from whi The first two unit entries 
in the first row indica’ 


the first half of the sequence 
jects. Within the first half, 
at any point in the Sequence. The ( 
column of S, indicate that thre 


of responses for each of the first two sub- 
ach response may have occurred 
hree unit entries in the second 


Second subject. Note that 
The first unit entry codes the 


Se, complete similarity of be- 
T products of 6 betw 


first a-type response, ete, Thus, for 
havior as indicated by inne 


Matrix S. is of order те X t, where 1 is 
That the maximal possible rank for a given | J. is, as in the analysis 
of S, gn. — (gn./q) +1 is shown directly by the fact that the row 
vector Y with unit entries only /9 times in cach ef iege 
Тһе maximal possible rank of S. in Table ll wi 

Given the information tha 
system are mutually exclusive, we know at once that S, was derived 
from a matrix S for which ng — n Те For there аге ng = 
Tows and q mutually exclusive classes, 
rived from 8 by elementary row or colu; 


gne 
However, S. cannot be de- 


Тап operations, Therefore, 
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TABLE 11 


Dara MATRIX, Se 


mE Subjects E 
ЖЕЛІ 


1 2. 
a=? q-3 пе = 9 
1 1 1 
а 2 0 1 
3 0 1 
ý 1 во 
b 2 0 0 
3 0 0 
1 1 0 
с 2 0 0 
3 0 0 
4 1 1 
а 5 1 0 
6 1 0 
92 4 0 1 
5 5 0 0 
6 0 0 
4 0 1 
с 5 0 0 
6 0 0 


of discourse than 5, occasioned 


S. corresponds to а different universe 
al sequence of responses on the 


by neglecting consideration of the actu: 
basis of hypothesis or of prior knowledge. Sc is particularly useful 
columns and may be analyzed in the 


for assessing similarity between 

same manner as S’. Caution should be exercised in analyzing the 

rows of Sa for the meaning of the row vectors can, in specific cases, be 

quiteambiguous. One хау to keep in mind the potential loss of infor- 
es of the type S. is to realize 


mation resulting from the use of matric 
that a given S, matrix can be derived from different S matrices and 


that different S, matrices can be derived from the same S matrix. 
On the other hand since each S, matrix derived from a given S matrix 
represents an hypothesis OT set of hypotheses about the behavior 
represented in 5, the analysis of S. matrices can be very illuminating. 


An actual S, matrix is analyzed in Chapter VI. 
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WEIGHTS FoR SUBJECTS 


Other types of problems than those considered here may be рт 
by bringing data into the standard form of the S matrix. For 8 
ample, suppose there аге 4 objects to which ; subjects respond on н 
occasions in such a Way that it can be ascertained that one of them is 


s : assigning а 
preferred over the remaining 4 — 1 on occasion п i Then assigning 
unit score to the Preferred object 


j ; ; : ; гете 
objects generates an 74 X t matrix, S, showing which objects we 


S can then be analyzed for the 
bserved first preferences. 

sider all of the preferences of t sub- 
$ of discrimination and for п occas- 


which contains P unit elements and p(q — 1) 


zero elements. ix over all subjects and occasions 
will be ng X pt. Then different q X ptsu 


bmatrices may be summed 
to get a summary 


Fri is 
The summary matrix i 
sires to use a least squares 

approach to sealing. 


Inleast squares Scaling one is interested 
for categories or of "Scores" for subjects w 


sense of least squares, Тһе Scales are derived from the internal con- 


S or from matrices derived from S, not from 
structural relations, The least squares scales 


8 01 successive latent vectors adjusted to zero mean 
and unit variance for the data of 5, Since a set of “optimal” or least 
Squares scales for a Particular set of dat 
latent vectors, they emphasize what is е 
о x. 


КЕТЕ 
in finding a set of “weights 
hich best discriminate in the 


а is derived from successive 
ommon to the sub-categories 
ЕД dimensions. 

In exploratory analysis the caso ік quite different from that of least 
Squares scaling; although one might start with exactly the same data 
in both cases. In exploratory analysis the simplest basis (or an orthog- 
onal basis best approximating the simplest basis) is derived from 
behavior vectors as different as Possible for the configuration of behav- 
lor vectors at hand. In other words the best discriminating redons 
are defined as those which best reflect differences in actual behavior; 
thus “discrimination” here has an empirical or naturalistic tinge. 
Those vectors of 8 defining a simplest basis are those with the small- 
est cosines with each other. The Problem of weighting the actual 
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DM represented in S then becomes one of weighting й in terms 
i nonse: m to underlying vectors, the underlying vectors being pro- 

ни > e vectors of S into an r-dimensional sub-space. 
T ҺЕ lng the actual behavior of the subjects can be accomplished 
нени en е enote by D; the diagonal matrix formed from the jth 
ар е he final (лд x r) matrix of coordinates on basis vectors 
абы. лв НОО has been applied. The 1 row should be 
Masc . Then find D;S and transform the entries in D;S in such a 
сы ланды mean becomes zero and their sum of squares is unity. 
mi this matrix by 5; Then find ZS; where Z has unit entries 
лде 4 X па. This is the array of scores for each subject for the jth 
а ector. The more similar the scores the more similar the sub- 
ae are in terms of the underlying response dispositions. The entries 

5; are the weights assigned to each category for each response and 
the score of each subject is the mean of his subcategory weights. Each 
Subject gets r scores for there are 7 matrices of the type бя one for 
each basis vector. 

It should be remembered that the scor 
the sums of n of the weights assigned to the а: by the operations de- 


scribed and it should be emphasized that the scores for the individuals 
are not "factor" scores. They represent the weighting of behavior 


ying relations. 
sh on the basis of hypothesis 
assign weights to the rows of S which are 
the basis vectors. These weights 
1 matrix and post-multiplied into 


es for the individuals are 


based on ascertained under! 

In some cases the investigator may wi 
ог of prior knowledge to 
different from those assigned by 


should then be written as a diagona à 
the (nq + 1) X г matrix of coordinates on unit basis vectors after 


which the Q transformation should be applied. The weight given to 
ater than any other. Then the 


the Y row should be equal to or gre des 
S matrix may be treated as before. In our opinion the application of 
ter of internal analysis 


arbitrary or hypothetical weights is not a mat 
but rather of external analysis. The relationship with criteria or the 


differential relationship of hypothetical and derived weights with 
criteria js what counts. Above all, the analysis of the arbitrary or 
hypothetical weights is not a matter of purely statistical analysis. 
Finally, we believe that hypotheses should be introduced into the 
data only in the sense that the sequence of responses is condensed or 
changed as when jn interviews, for example, thirds or halves of inter- 
views are taken 25 the unit of analysis rather than the response. 


CHAPTER FOUR 


ORTHOGONAL BASES 


APPROXIMATING AN OBLIQUE 
SIMPLEST BASIS 


Іп an exploratory analysis the matrix of inner products of the vectors 


of the simplest basis, Re, is the matrix which defines the internal pro- 
ductivity of the classification System. As such it is fundamental. 
However, in general Ra will not be an identity matrix, reflecting leu 
fact that most empirical categories will not be mutually exclusive. 
Oblique bases confront us with essential and mathematical difficulties, 
the essential difficulty being that overlapping categories present com- 
plex problems of inference. Since Ra will usually be respectably close 


hogonal basis approximating the oblique 
a Will have associated with it an nq X т matrix 
есіогв of S with the orthogonal basis vectors 


configuration of vectors is such th ating position is 
obvious. This will often be the case. In considering computational 
methods we observe, following Green (1 


; 952), that the problem is to 
find the ng X у matrix, G, where 


where G is the matrix of coordinates a. 
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ог Я 
мы. = € A is an orthogonal transformation matrix with 
thers isto bs ength. In general no such A exists. Therefore, 
айта os 4 = bâsis best fitting A in any absolute sense and a 
the selection А ning G is needed; the criterion may be objective but 
One іу de е» given criterion is ns an objective p 
is deg о criterion for selecting a best fitting orthogonal basis 
atitea В nee criterion. | Unfortunately, several least-squares 
еее des be used which give different “best-fitting” orthogonal 
B. is “Жебе take а least-squares fit to R,, the reference basis to 
ma least east-aquares fit to Ја can be used. Furthermore, one can 
With Р -squares fits to the nq X 7 matrices of coordinates associated 
ring me and R,. There seems to be no compelling reason for prefer- 
thin: 4 ca of the least-squares solutions. One sensible solution is 
défis ue depends upon Ra for it 18 the basis which most directly 
it 5 мна productivity. Green has considered this case among 
to hi в. What follows depends upon his development. By reference 
$ paper (1952) it can be shown that 
Ат- (A'GG' А)“ АЕ (84) 
as follows: ¥ = X3? if and 
For any n X n square sym- 
an find an orthogonal 


госеѕ5. 


where the fractional exponent is defined 
T" ut уз = X; by Хо is meant у“. 
Metrix matrix of rank n such as (А” GG’ A) one с 
Matrix P such that 

A'GG'A = PDP’ (35) 


are the latent vector: 
re the correspon! 


where the columns of P a AAA, ma e 
elements in the diagonal matrix D a ding latent roots. 
Now from (35) we have 

(a'ga aye = РРР’ - 
for 
ppwp'PDP' - PDP' 
because P is an orthogonal matrix for which P'P is an 1 matrix. 
Therefore, 
(аваг) = РОВ (87) 
rmation matrix, A, тау be obtained by deter- 
and latent roots of A'/GG'A. These vectors 
d as the matrices P and D. Finally D-¥? 
ls of the positive square roots of D. 


The orthogonal transfo 
mining the latent vectors 
and roots are then arrange 


is simply the matrix of reciproca 
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= ing the 
A is then found by using equations (34) and (37). Applying 
method to №, we find 


А-(ЕРУ-чар” e 
where F is obtained from R, 
This is equivalent to m 
between the simple a the 
Since cosines ате а measure of similarity, maximizing the sum of 
cosines maximizes similarity. у 
Since the kinds of least-squares fits are many, the p ione 
lengthy, and the choice Personal, we outline below а method for fin 4 
i ancously a best fit to Ra is 
f differences of cosines. T he 
an extension of Gibson's (1952). Given 
П orthogonal basis whose vectors PU 
angles with the corresponding vectors of Ra an | 
s with minimally different angles is not difficult: 
angles between the 


by the diagonal method of ае 

. n е 
aximizing the sum of the cosines of the pes 
xes and the corresponding orthogonal axes. 


method may be considered as 

ау the problem is to find a 
minimally different 
Res Finding a basi 
simply bisect the 


» for if the inner products of Ra are €— 
» or tend to be, negative. We will now 


: Consider 
Ra = pp (39) 
where F 15 obtained by the diagonal method of factoring with unity 
in the diagonals and 
FR = р (40) 
FFA, = p, (41) 
where L is the matrix of normalizing constants for the columns of F~- 
Also consider 
F' pay (42) 
and 
M" + Р-р 4. Рм = р, (43) 
where M isa diagonal m 


atrix of normalizj 
im is that the 

Ra Ry, Ra.. od, (44) 
goes to Г, the identity matrix, 


of F' + FL. The cla; 
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ы. сс of squares of each row of F is unity with the first 
This is a е eing unity and the others equal to or less than unity. 
Sinot ath iom of the diagonal method. If (40) is to hold, the 
the licae! ? à each column of p must be greater than unity when 
unit re N ues of F are not unity. Since the rows of F represent 
(40) repr vectors and the columns of F~ do not, the J matrix of 
presents the perpendicular projections of non-normalized vec- 

Thus the J matrix of (40) shows that 
F- have zero inner products with all of 
This implies that the angular separa- 


ti 

ван хез represented by the columns of F~ are greater than 
the езе — by the rows of and thus of Ra More precisely, 
P fa the cosines of the angles between the vectors of Ra 18 
Diss an the sum of the cosines between the column vectors of 7. 
жүзе + shows the effects on (40) of normalizing the columns 
аш на shows that the normalizing constants are the cosines of the 
Fanati between the simple vectors and the reference vectors of FL. 
(asd E (41) justifies the use of the expression, reference vector, 
enini Thurstone’s sense) because it shows that the normalized 
simple vectors of F— are each perpendicular to all but one of the 

vectors. 

si MN: (42) is an expression for vector sums. Consider the 
» ns: by the parallelogram law, the vectors of (42) are in the planes 
panned by the non-orthogonal (associated) simple and reference 
vectors. Tt follows that the sum of the cosines between the column 
vectors of F’ + F-1L is less than the sum of the cosines in R, and 


mor қ : Я 
ore than the sum of the cosines 1n R,, the matrix of inner products 
Furthermore, р. will be more orthogonal 


in the same way that HR; is 


tors 
a upon normalized vectors. 
е ver ne 
d» у ectors of the columns of 
unit basis vectors save one. 


9f the reference vectors. 
than R, because Rs is obtained from Rı 
obtained from Ra- 
" Although the sum of the cosines ir 
he. «+. В, becomes successively less, 
ether the sums of the cosines of f, 
The question сап be answered in the negative because the diagonal 
values of the inverses of the F; have minimum values of unity since 
the diagonal values of the F; have maximum values of unity. Equa- 
- (40) guarantees that when the diagonals of F~ are unity then so 
are the diagonals of F. Since the sum of squares of each row of F is 


n the off-diagonal entries of А}, 
there remains the question of 
say, can be less than zero. 
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unity, F then becomes an J matrix and F = F’ = R = Ко. 2. 
fore, the sum of the cosines of each of the R; is equal to or grea 
than zero. m— 
Another question arises: Even though the sum of the en s 
the matrices Ra, Ri, . . ., В, becomes less and less, does it happen tha 


P А Л For, 
the sequence goes to 17 The answer is in the affirmative. 
consider the sequence 


T 5 
Р' + Е, Р 4 р, р (45) 


The ith diagonal value of F, for e 
unity; of F- is 1/z, of P' + F- i 
value of Fy + F3 is z, + 1/1. 
the sequence 


п 
xample т, is equal to or less ne 
sx + 1/т. Also the ith dingo 
However, ду > х. Hence we hav 


46) 
t+ Ve>m+1/y>...> 2441/2, ( 

The maximum value of т; 
unity. This minimum is, 
the minimum value of 2; 


; ; Cai xis 
is unity and the minimum value of 1/ 3 

за unity ar 
of course, reached when z; is unity 


e 
+ l/z;is 2. It can be shown that the 8 
quence (46) converges ава cons 


equence (45) converges. Now paya 
possibly for the first diagonal value of Fi + Fi values of 2 in jus 
diagonal ean occur only when the off-diagonal entries of the aie 
are zero. Hence r — 1 of the vectors of (Pi + ҒӘ) are orthogona 
when the diagonal values are 2 and (һе value 2 can be attained, for the 
Sequence converges. Now if the off-diagonal column entries of 
(F; + F3) are zero, 


80 are the off-diagonal row entries, Therefore, 
if the diagonal values of (Е: + FS 


E ?) are 2, the column vectors of 
(F; Е») must ђе orthogonal. 


£ determined by the previous one. 
Bisection certainly minimizes the differences between the angles 
created by the bisection. The final orthogonal basis, therefore, is 
obtained by a series of minimizing Processes. The presumption is 
that what is minimized is the sum of differences of the cosines of the 
final orthogonal basis on the one hand and the vectors of the simple 
basis and the associated vectors of th 


(Әне reference basis on the otber. 
A detailed proof that (44) goes to I is given in Appendix B. 
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The о е ТРИЕ : 
fidus process of orthogonalizing is shown in Table 12. Тһе steps 

omputational process follow: 

1. Сіз а 5 

Basis 21 Ra the matrix of inner products of vectors of the simple 
» find F by the diagonal method of factoring. 

2; -I Q; E ІН 
йе Compute ка, Since F is triangular with zero entries above 
sm ain diagonal, it is convenient to use the relation F'F'— = I for 
iis puting F—, 77-1 has as diagonal entries the reciprocals of the 
үлы ақың entries of F’. Тһе entries of F’~ below the main diagonal 
мнр 89 only the entries above the main diagonal are unknown. 
ae the inner products of the row vectors of F’ and the column 
Kinet of P" are zero, the unknown value іп F'— may always be 
‘les for with but one unknown if one starts by solving for the 
€ Down entries of 7771, working up from the main diagonal in each 

umn. Note that 77 = (F-1)’. 
" 3. Normalize the columns of F- 

he jth entry of L is 1/М1; where l; is the sum of squares of the jth 
column of pa, 

4. Take (F’ + F-L)M where M i 
и аге 1/Ұ/ту where ту is the sum 0 
, a 
OF eat FL) and premultiply by its transpose. 

M(Ra + LRGL + 2M = № 
roducts of the approximate 
ntly approximate orthogon- 


1 with the diagonal matrix L. 


s a diagonal matrix whose 
f squares of the jth column 
The result is 

(47) 


Equation (47) is the matrix of inner P 


orthogonal basis. If А, does not sufficie 
ality it may be treated as though it were Ва and the process repeated 


to get Rs. Л, will often be à satisfactory approximation to orthog- 
Onality. №, will nearly always be a satisfactory approximation to 


orthogonality. 


5. Find 
AF(F’ + КОМ = А (В.М + LM) 


= A(M + ВМ) (48) 


where A is the matrix of coordinates on the simplest basis and А is 
the matrix of inner products of the data vectors with the vectors of the 
simplest basis. This is the transformation carrying A or A to the 
best-fitting orthogonal basis when Ја is orthogonal or considered to be 
satisfactorily orthogonal. When ја is not satisfactorily orthogonal, 
R, must be found. Associated with Әзіз (Fi + FUL) My. If Re is 
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8 ; in 
satisfactory, then the transformation carrying A to the best fitting 
orthogonal basis is 


А(В.М + LM)(M, + RLM) (49) 


It should be noted that in (48) F is introduced to get the шешімі 
of A expressed іп the same arbitrary orthogonal frame as the vectors 
of Ra and В, which are in F + F’ respectively. . | 

When the off-diagonal entries of Ка are equal, then the off-diagon® 


entries of LIE will also be equal and opposite in sign to those of Ra 
When the R, is of rank 2, 


then Ra + ІВ, = diagonal (2, 2, .. ., 2) (60) 


Іп this case the values of 


Ка + LRGL + 2L = diagonal 


It is interesting to note ( ast-squares solutions 
are identical when the elements of Г, are identical. The solutions 
referred to are, respectively, those in which the best orthogonal 
approximations to reference ve 


ctors and to the vectors of the original 
oblique basis are sought. The values of 7, will always be identical 


when the rank is 2. Hence Green’s solution and our solution are 
identical when the rank is 2. Now Green's so] 


maximize the sum of the cosines be 
and reference axes and the corresponding orthogonal and simple axes 
when the values of L are equal. 

sum of the differences betwee: 


hat two of Green's le 


based on primary axes and reference axes will be identical. As the li 
vary, Green’s bases will “rotate away” 


from each other. In practice 
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TABLE 12 


ORTHOGONALIZATION COMPUTING METHOD 
ror Data оғ TABLE 4 


Ди + RLM) 


— (ТАЙ 
A, = 52 —14 І I 
A g – 14 52 1 .06 209 
ac, ON OBAT 2 09 56 
3 58 10 
r ІШ, = Re 4 23 55 
ИННА 5 49 42 
Е 100 —2 
2 u = — 1 еі в 058 28 
A 10 0 ne т 43 34 
и = ee 
__ Bet Ret th __ 10 66 5 
EN o 3.93 00 п 02 25 
EN a Және 
100 00 3.93 12 73 40 
Me. 13 08 257 
= 14 .67 .08 
M(Ra + В. + 20М 15 37 -64 
Lc 16 .39 200 
E. LT E i y 76 (65 
1.08 -.29 ____00___==== re 
e - 
(м + RAD 
a eee 1.03 — 
.96 —.14 1.03 
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therefore, our solution w 


ill generally closely resemble Green’s two 
solutions for, as he points 


: SP = а: rë 

out, his solutions will yield very similar н 
; aki 'en- 

sults in most cases. To put the matter another way, taking the с 


я | : > 5 s very similar 
troids of his solutions should yield a set of basis vectors very siml 
to those attained by our solutions. 


The computations usin 
in Table 12, for the dat; 
and 5. 


Matrix Ra of Table 12 is identical with the matrix of Table 5. was 
obtained from №, by the diagonal method of factoring and BGs 
obtained from the relation РР’ = 7. L is the diagonal matrix of 
normalizing constants for the rows of P'-' and for Rẹ. The enis 
of M are m; = 1/2 + 2l; where 1; is the diagonal entry of the ith 
row of L. The rest of the table is self-explanatory. 

It should be remembered that the ort hogonalization process 
illustrated applies to any set of oblique basis vectors. ‘Thus, one 
may estimate В, from Ro, orthogonalize and rotate graphically to Re 
if the estimated R, is not the actual R,. Orthogonalizing on the 
actual В. then gives the final approximating orthogonal basis. 


£ the orthogonalization method are 2. 
а of Table 1, Chapter И and from Tables 


CHAPTER FIVE 


HIGHER ORDER MATRICES 
AND VECTORS 


T " 
Pp M matrix S in standard form is 8 representation of 
—— Е id of the classification system applied to a set of 
status: ae they occur. ‘The rows and columns of S enjoy a unique 
vectors Баты) are the original, empirically given, vectors. Other 
"ewe mnot be constructed without possession of the information 
higher = S. Of particular interest are what might be called the 
тепсе order vectors. Ав the row vectors of S show the single occur- 
and generate the pairs of occurrences, 50 do vectors exist which 
уз nerate the triples, quadruples, etc., of occurrences. The vectors 
Showing the pairs and generating the triples of occurrences may be 
Constructed as follows: consider row vectors i and j of S, i correspond- 
mg to п; and j corresponding to 7) Form a new vector denoted by 
147, the elements of which are the products of correspondingly placed 
elements in row vectors 7 andj. The vector 6) is à Boolean “and” 
vector showing directly the responses common to vectors i and jin S. 
In matrix notation the different vectors of the form i&j may be written. 


844. S&iis an ng X į matrix representing the Boolean “апа” vec- 
to the ith row vector of S. There are ng matrices 


of the total of (nq)? row vectors in the ng 
The upper limit of the number of different 


tors corresponding 
of the form Sci but not all 
matrices will be different. 
such vectors is 


a "= 1) 


е vectors of 5. Thus when considered as a 


and nq of the vectors are th 
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matrix, the vectors of the form 7&j form an 


ng(n — 1) 

EXE эви 

matrix, nq of the vectors being the vectors of S. 

Some of the vectors of the fo 

upon the vectors of 

independent of the 
the vectors of the 


xt 


3 t 
rm 46) may be linearly dapes 
S and some may not be. Those vectors a 
vectors of S indicate that the simplest basis 


ng + EOD, 


matrix of vectors of the form 1) may not be the same as the simplest 
basis for the vectors of S. That the basis for the higher order vectors 
may be a basis in a higher dimensional space than the basis for um 
first order vectors is hot а serious problem. What. is serious is Е 
none of the simplest basis Vectors for the vectors of 5 may be vectors 
of the simplest basis for the vectors of the augmented matrix, the 
matrix containing 6 and the vectors of the form 247. This nds 
Suggests that the difference between the simplest basis for 5 and for 5 
augmented by higher order vectors be investigated. 

When row vectors 7 and j of S are identical then 

Sei = кеј 

апа S&is’ = 8418” 
When vectors 4 and j have high cosines, the pairs and triples of occur- 
rences associated with i and J will also be Similar. Even when i and J 
have relatively low inner Products, the cosine of i&j with 7 and with 7 
will be high relative to the cosine of 7 and j, for all unit entries in 267 
will be shared by i and j. Thus when the rank of S augmented by 
the different vectors i&j is greater than S not 50 augmented, the length 
of the images of the 1&j in the subspace defined by the vectors of S 
will be high in relation to the lengt 


h of the ?&j in the higher dimen- 
sional space. 


ples of occurrences by tak- 
ing S&i&jS’. The rank of each в 


ke ша! to or 
greater than the rank of S and there Will be as many different matrices 


ав there are different combinations % а ЕВЕ 
therefore that, for a given S matrix a У all of the different 
vectors from 561, 86467, S&i&j&k, Uo Ваа. . «пд at- 
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tached, th 

tive to los augmented matrix may have a large number of rows rela- 

конне іс 2 of rows in S. That the vectors of S form a sub- 

also clear fs уон defined by the vectors of the augmented matrix is 

T ü T > ў ере А 

іші, om he ith row vector of S is also 2 vector of the form i&i, 

Adopti А 

ея ы” ng the convention of calling 5 a matrix of order one, S&ia 

vector in ^ der 10 S&i&j а matrix of order three, ete., the ith row 

niet ттн tj will then be a vector of order three or а third order 
: 1e ith row vector of the second order matrix Sei will be а 


Second order у Missed 
order vector. The inner product of the jth row vector with 
е equal to or Jess than the self 


itself j А 
ге m i matrix of any order will b 
pro d У ха. дай x x ё si 
Suse Pros uct of those vectors in S contributing to its formations and, 
general it will be less. 
osines of the kth row vector 


of zn the second order vectors the cosines 0 © с 
lis ае и the three vectors of S entering into its formation will 
has is 4 1 han with other vectors, for the kth row vector of S&i&j 
ilas ge entries in those positions 11 which the three vectors of S 

ave unit entries. It follows that projections of all higher order 


Vectors i А 
rs in the subspace defined by 5, have lengths which are substan- 
lengths in the higher dimensional space 


This can be stated succinctly as 
annot form an orthonormal com- 
ctors of S form an orthonormal 
f S are members of an orthonor- 
rs are ident ical with the first 
rank of 6 exceeds 4, any data 


и of their | 
гы Ley all vectors of all orders.* 
plete i T he higher order vectors С 
com эя of vectors unless the data ve 
bm plete веб. But when the vectors 0 

complete set, the higher order vecto 


ME vectors. It follows that when the 
basis formed from higher order vectors must be an oblique basis. 


"These considerations imply that if S has rank r, then the coordinates 
of all vectors of all orders on a data basis from 8 will 
contain the greater share of the information contained in the coordi- 
nates on the s(s > 7) vectors of the basis for all vectors of all orders. 
Since the length of the higher order vectors decreases as the order 
increases, a reduction of the problem can be achieved by using only 
first and second order vectors in obtaining à simplest basis.[ А final 


the r vectors of 


order vectors into the subspace defined by S 


the higher 
higher order vectors are so short as to be 


* When the projections of 
ortion, the 


is not a substantial proP' 


negligible. 
in an unpublished paper that triples of occurrences are 


ЇТ. Koopmans has shown i! 
sufficient. to determine 2 structure of latent classes. In latent class analysi 
o the communalities of factor analysis are used. d 


values analogous t 
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В roportions 
reduction of the problem of analysis to one of manageable pr ve көй 
, 

may Бе effected by taking a sample, perhaps a random one, of, 


P nted 
nq of second order vectors and analyzing the 2ng X t augme 
Matrix. 


The analysis of an 


in stages. 
augmented matrix should proceed in stag 
First, the effective ran 


A is is best 
k of S should be determined. This 18 е 
accomplished by extracting principal axes. Given an ae e s 
of r for S, the second order vectors selected may be added to Sa 


"M у 1 ined 
principal axes extracted. The analysis may then proceed as outli 
in Chapter 2. 


For any Particular set of dat 
data basis for 8, the simplest 
simplest basis for the 
Тһе higher order veet 
vectors and higher or 
cosines with the Y y, 


a there is no guarantee that the po 
basis, will be the same basis as 
S matrix augmented with higher order ce 
ors, being in general shorter than the first амла 
der vectors of a preceding order, will have чини 
ector than the vectors of S. Thus the boundary 
be higher order vectors. Usually, however, iu 
> augmented matrix апа for the first € 
matrix should be markedly similar when dimensionality is restricte Я 
to the dimensionality of the vectors of S. By reason of their manner 
of formation the higher order Vectors must have relatively high cosines 
with all of the vectors of S used in their formation. The nearer the 


rank (or the effective rank as determined from the principal axes) 
of S to q, the number of sub-c] 


asses in the classification system, the 
more likely it is that the simplest, basis for S is quite similar to the 
f the augmented matrix. In a Варя? 
* of the information contained in 5. 
ases from Unity or above to less than unity 
9/ (т — п + 1), 8 contains progressively less 
gher order vectors. Thus q/r can serve as à 
nalyse or not as the case may be, the 
higher order vectors. Аз q/r decreases, so does the index of internal 
productivity. It seems, therefore, that as the images of the higher 
order vectors in the subspace defined by 4 represent the higher order 
vectors less and less adequately, analysis of the data itself will prob- 
ably be fruitless because the classification system is unproductive. 


simplest basis for the vectors o 
the expression q/r isa measur 
As the expression decre. 
and to its lower limit of 
information about its hi 


base for a decision to a analyse, 


CHAPTER SIX 


APPLICATIONS 


s of exploratory analyses on 


In thi 
is ehapter we will present the result 
] test, from verbal behavior 


data fr р 
іп ыы different sources: from a menta 
ychotherapy, and from an attitude inventory. 


TH 
E HALSTEAD CATEGORY TEST* 


— study to be reported is from an unpublished analysis of data 
a tered by Simmel and Counts (1957). The 61 subjects of this study 
ginally comprised two samples: 39 patients seen in conneetion with 
а study of the effect of temporal lobectomy as a treatment for psycho- 
Motor epilepsy and 26 student nurses- Both groups were given the 
Halstead Category Test, а test designed to measure abstraction or 
concept formation. The test ha bed by Simmel and 


i s been descri 
ounts (1957, p. 7) as follows: 
of visually presented 


Briefly, it consists of several groups vist 
stimulus patterns. For each group & “principle of correci 


responses" has been defined by the test author and is to be dis- 
covered by the subject. The subjects response 10 each item con- 


sists in pressing selectively one of four numbered keys which is 
: е wq» чо» “g”, or "4". В 
іп fact equivalen › „от “4°. By means 


410 saying 1”, 
of a chime for correct responses and a buzzer for incorrect те- 
sponses the subje appraised of the accuracy oj 


ct 18 continually 
his performance and is, in а sense, rewarded and punished. 


used in the studies reported here. The Halstead Category 


* Communalities were 
Test was analyzed bY pr. Sarah Counts. 
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In comparing the findings on the two groups Simmel and о 
(1957) found that the student nurses made significantly fewer ait 
but that the same pattern of item choices prevailed for both € "e 
This is of import for the current study. Ordinarily a factor rid ган 
technique should not be undertaken with an № as small as 61 pula 
of the possibility of sampling error sufficient to mask, if not mo 
structure. It was concluded, however, that the prior demonst a 1 : 
of the similarity of item distributions for two quite different group 
Justified the analysis with 61 subjects. f the 

Іп this study data from the first sixteen items of Set III o ma 
items in the test were utilized. However, Sets T and IT, се : 
principles used in getting to correct choices are relevant since md 
influenced responses to Set III. Se Each ite 


Set I has eight. items. ipit 
i i ЧИРЕ вс 
consists of one of the first four Roman numerals. 5 responc 


rectly if he pressos the key Corresponding to the same numeral. F Е 
Set П each of the twenty items consists of either one, two, three ч 
four Separately articulated figures. The task for 5 is to count t i» 
igures and press the corresponding key. For 
; the subject is "encouraged"—first to respond to 
numerals and then to count the number of figures. з 

Set ІІ, the set analyzed here, does in fact confront the subjects 


with a new principle. n Set TIT consists of four separate 
figures. Throughout the first sixt 


identical with one differing from the others. The essential task is (0 


at the ordina] Position of the differing figures 
If, then, he “numbers” the positions of the 
right and responds to the position of the odd 
figure, he can consistently give correct answers. The subjects quickly 
note that there are four figures with three alike and one different, and 
i d figure varies over items; many; 
ance of the variation of principle 

leading to correct answers. 


; the classifi 


Since the third set of items numbered sixteen with four choices 
each, and since the subjects numbered 61, the S matrix corresponding 
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to the > 
rid, as 64 Х 61 with а possible rank (ng — n+ 1) of 49. 
r the Y vector was removed, computations were performed on 


а digital c А 
gital computer using а centroid program followed by graphical 
ghest off-diagonal 


Totati " е 

нент from the first residual matrix, with the hi 

WERE за mg used for communality estimates. Only two basis vectors 
' retained because the number of subjects was but 61; and the 


residuals s жүй 
ae seemed sufliciently low for such a small sample. 
he elements of A, the loadings on the simplest basis, and of 


AQ- 
еу ыс сата а normalized to the projections of the ү vector on 
Tabie 14 с asia vectors, are shown in Table 13. QR.Q is shown in 
Nis ago . be the basis vectors are orthogonal, the classes isolated 
н ly exclusive. Note that the solution is unique, since no 
sank об possible without obtaining negative numbers. Also the 
wh erential „(па — n + 1) of 0.04 is 96%, below the maximum 
1e and the index of internal productivity is 1.00. Hence, we 


cone ne 
nclude the system possesses considerable productivity. 
vector Ai is determined by 


ene Table 13 it is clear that basis s i 
only basics ai Almost all correct answers have high loadings, the 
id exceptions being the items near the beginning of the sequence of 
th 8. Secondly, if alternatives ug? or “4” represent correct choices, 

en the alternatives lie close to 41. When “1” and «3” are correct, 
Бе choices tend to have substantial loadings on both A; and Аз. 
Turning now to Аз we find that it is determined by ау” and “3” re- 


Sponses whether correct OT incorrect. It would be а mistake to 
it will be shown that it is better 


characterize Аз as a “wrong” class; : 

characterized as a “counting” class- Finally, it should be noted that 

Most of the incorrect “2” and. “$” ives have loadings on 

neither factor and have, in fact, low 
At this point it appears that there are 

determine loadings on 4: and Аз (or A1Q3 and 4303). 
(a) Right or wrong choic 
(b) Choice or response nu 
(c) Item number (n) р 

The primary determiner of basis vector А: appears to be correct 

choice of the item alternatives. Therefore, we will call this the 

th loadings of 0.40 or higher on 


“correct class". All alternatives wi 
A,(0.58 or higher ой AQT) represent correct responses; conversely 
all alternatives with loadings less than 0.20 (0.27 on 4107) represent 


alternat 


frequencies. 
three characteristics which 


е 
mber (9) 


TABLE 13% 


Loaprxas or RESPONSES 


жж 
ALTERNATIVES ON Овтноховмат, Basis VECTORS 


(ENTRIES оғ MATRIX A) 


Basis Vector А, 
Alternatives 


Basis Vector Аз 
Alternatives 


Hem 1 2 3 д 1 2 3 1 
1 2 0 19 ga 04 00 12 5 
2-10 02 ad o is 05 м 
з 0 0 o п 00 56 05 
4-04-02 5 ә з о 3 10 
5 10 47 в o од 00 47 16 
86-05 00 0 5 Is 04 37 18 
7 80 00 в og 2 00 239 22 
8 0 62 о o ?8 06 31 05 
9 0 00 в o 22 0 36 07 
10 в 66 аи og 38 0 29 0 
11 -00 01 в о 25 08s 39 00 
12 и о 02 o 26 02 39 04 
п о 0 o 65 29 02 39 0i 
м о 67 o 27 10 26 09 
15 о 00 02 67 21 о 46 04 
6 вз 65 ù p 32 08 24 08 

Entries of Ад“ 
1 28 00 2 45 05 0 16 79 
2 B о в п 5 07 5 19 
805 00 5 49 15 00 78 07 
4 06 0 в в 32 0 53 13 
5 4 67 в p 2 o 66 22 
6 08 00 із 80 2 06 52 17 
2: 87 00 05 09 30 00 40 30 
8 02 96 о o 39 ов 43 08 
9 01 00 99 0 31 09 50 10 
10 05 95 00-0 53. 0 30 0 
11 00 02 98 00 35 и 5 

1 54 00 

12 97 00 0% 00 37 T 
- 35 08 54 06 
13 02 00 06 93 40 03 55 02 
4 ^ 0 -0 97 00 8 35 мов 

15 03 0 03 96 29 00 n 
16 04 94 00 00 44 = T 
94 из и 


* Decimal points are omitted. 


** Underlined alternative is the correct alternative, 
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TABLE 14* 
MATRIX оғ PROPORTIONS /^o Library 
ron Елсн CLASS “у È 
QRQ ж pis 
2 
А, А» x 
А, 48 М pe 
A» 52 


* Decimal points are omitted. 


errone 
ous responses; and, after the first two trials, all erroneous Te- 


T have loadings less than 0.20. 
fen е е one minor exception: 014 
is only 4. 1, сһоісе 1), the correc à у 
on A,. erlap of loadings between correct and іпсо 
Ped correctness of response is 
i9 de; ive to have a loading on Аз, 
а place of the item in the sequence 0 

rewarded" by a chime for à correct response ; “punished” by à buzzer 
for an incorrect one. Since the later items have the larger loadings 
on Ај, we can hypothesize that А: is determined by those subjects 
who have learned the principle for correct responses. Thus, although 
A, is determined by correct responses, 16 seems to be determined by 
the correct respotiess of those subjects who have discovered the princi- 
Ple for selecting correct responses. They give intrinsically correct 
responses; Ал represents the behavior of an “intrinsically correct 


т i 
esponse" class of subjects 


(item 1, alternative 4) is greater 
t alternative, on A, This is 
rrect responses 


the necessary condition for an 


the size of the loading is related 
f items. Each subject was 


y correct responses which 
be shown by comparing 
3) with the incidence of 
larly for the latter 


f intrinsicall 


Correct responses. 
half of the items, 2 


Successfully chosen by = N h 
than have items where are correct. Nevertheless, the 


loadings in the latte? half of the test for ар?” and “3” versus “2” and 
“4” correct alterna rly the same on Аз. Note also that 


Correct responses оп ug alternatives are only incidentally 
correct. This is reflected in the substantial loadings found on А» 
аз well as А2. 


4 
í 
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А ; re, 

By the time Set IIT of the test, the set under gr be 
has been reached, there are two classes of subjects. One ipee к” 
grasped the principle of ordinal Positions of the diferent fgur obs 
items and is Tesponding consistently. It is the subject 5 кетеді As 
of performance which actually determines A,. Subjects in They 
class, on the other hand, have not really solved the problem. d their 
give some correct answers but cannot do so consistently an ve 
correct. performance does not contribute to the determination uen 
That is not to вау that the performance of the second class is ii One 
sistent or random for this behavior contributes to the жад quein 
These subjects do, however, increase the incidence of co 
responses for those items for which ua 
Basis vector A» is deter 
counting tendencie. 


and “3” are correct. — 
mined by responses which seems to re vill 
8 оп the part of the subjects. Therefore Аз Y 

nting class” because the 


i llow 
responses appear to fo 
principle used in attainir 


ies to one 
1g correct responses to ith 
ive W 
For instance the alternative ріс 
ects gave this response, a sin 


from the counting 


of the preceding groups of items, 
the highest loading is 014; 38 subj 
counting response. (It had been 


Hence 
; the correct alternativ. Her 
the subjects Were rew 


Principle, the type of 1 ably had received rein- 
i This reinforcement 
rth responses: 033, which 
has the highest frequency 
among the incorrect ourth item. Throughout 
the test the highest lo аз» responses. The incor- 
rect ''3's" are pure on A», whi 
ings оп А; as well. Thus, i 


Stantiated by the “1” 
responses which are high on As, since they represent another counting 
principle: "one different”. Note ther, that when the “1” and “3” 
responses are loaded on both А; and As, the number of correct, re- 
sponses is inflated. Thus, part of the 4, loadings from «1» correct 
and “3” correct alternatives are attained through the use of the wrong 
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princi 

Considerat ји also inflate the As Јова. А 
orthogonal hp QR,Q shows first that the basis vectors are 
counting көде гар ка that the correct principle class, А, and the 
tively, iple class, have proportions 0.48 and 0.52, геврес- 
А al scs of each subject was сартай 
Broup of ес (1, 3, 4, 5, 6, 8 10, ја, 
question al ms (2, Г, 9, 11, 12, 14) and on all items. There 
this all ec я nou including the first item in the 4i group, apart from 
еденді. i ; ect responses have high loadings on the class of intrinsically 
tjus 5 ibiq The Аз group of items is the group for which a 
Rémi nin may have resulted from an intrinsically correct ге- 
н = 94 from a counting type of response The resulting score dis- 
eer 1005 ure shown in Table 15. The distributions for the two 
ihe 2. items over all subjects are markedly different in shape: for 
ine 1 items the distribution of scores is bimodal 8 for the Аг negatively 

wed. For all items the distribution is again bimodal. 
Ps 15 shows the frequency distribution for the 34 subjects 
o7 о did not discover the principle of correct response and for the 

subjects who did. The criterion for selection Was the distribution 
of errors on the ten Ay items, for there was a clear separation between 
those subjects making six oF more errors on the ten items and those 
Who did not. "Table 15 shows rather clearly the two class structure 
of the items but docs not give the detailed information of А07, or of 
its plot. Note that on the g items (2, 7, 9, 11, 12, 14) with 
substantial loadings on Аз the distribution of the subjects using the 
counting principle is essentially normal with à modal score of three 
errors. The group of subjects who demonstrated on the A; items 
that they had grasped the сог 


тесі principle and were able to give 
intrinsically correct responses have a J distribution on the Аз items 
With their modal score at zero 


errors.* 
Other interesting aspects 0 


f these distr 
RUMP T 
* There is one exceptional case. This person. made four errors on A, items and 
five on А, items for & total of nine errors. His protocol shows that in items one 
through eight he made only one err ад; 9 ; Hiowevet. beginning with item 
nine, all responses We? incorrect and were all “1 and 3” responses. The preva- 
lence of these responses suggest that he was counting. 


for the number of errors 
15, 16), on the Аз 
was some 


countin 


ibutions emerge when diffi- 


Frequency DISTRIBUTION оғ 34 Cou 


TABLE 15 


STING SUBJECTS 


AND 27 ESSENTIALLY CORRECT SUBJECTS 


Counting Correct 
No. of Errors Subjects Subjects 
A, Items 10 12 | 
1, 3,4, 5, 6,8, : Es 0 
10, 13, 15, 16 ~ - б 
6 2 0 
5 0 0 
4 0 & 
3 0 2 
2 0 7 
1 0 10 
0 0 = 
S34 827 
A» Items 6 2 : 
5 4 
2, 7, 9, 11, 12, 14 4 0 
3 15 0 
2 1 2 
1 2 9 
0 0 15 
S34 S27 
All Ttems 34 Counting 27 Correct 
Subjects Subjects 
15 2 0 
14 Э 0 
13 9 0 
12 19 0 
11 5 0 
pa 2 0 
У 1 1 
E 0 0 
"i 1 0 
Е 0 1 
5 0 1 
4 0 3 
3 0 6 
1 в 3 
0 с 8 
2 4 
S 34 827 
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are considered. Table 16 
and the counting sub- 
le for making correct 


cult; Р 
саа or percent errors per item 
docti пе results for the intrinsically correct 
` e subjects who discover the princip 
TABLE 16 


Per Cent Errors PER ITEM 


10 A, Items 

6 А, Items 

16 Items 

jon of errors on both 
made on the early 


proport 


probably 
le had been discovered or 
s whose re- 


ude on the basis of 
e four alternatives 


n 

ТА her make approximately the same 

items of ney Most of the errors were І 

before i the test before the correct princip 
it was consistently applied. Гог the 34 subject 


e reflect counting activity, 006 might сопе! 
мет that their answers were random; there ar 
Per item and three-fourths of their т sare incorrect. However, 
Most of their responses (о А items are incorrect, but only half of their 
responses to А» items are incorrect. The latter figure corresponds 
ka За expectation based on A -і The Аз items are those on which 
: or “3” are correct and thes tend to respond only with 
3” or “1”, Under these conditions [ the responses would be 


expected to be correct. 


esponse: 


e subjects 
half о 


SUMMARY 
nd supports the findings of 


vith a 
“іпігіпві- 


consistent W 
These two classes represent 
cally correct” and “counting” response dispositions in two groups of 
subjects. One gro ains subjects giving intrinsically correct 
d their consistently correct answers generates class Ал. 
able to give correct 


As the test proceeds more of the subjects are 
or the later items rise. Тһе second group of 


answers; thus Joadings f : 
rly in the test. Nevertheless 


subjects does not solve the problem eai 
t random, and in fact are closely related to 


The two-class solution is 
Simmel and Counts (1957). 


up conti 


responses an! 


their responses are not. 
certain aspects of the stimulus constellations. The interpretation is 
d with what has been called counting activity. 


that they respon 
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THE PSYCHOTHERAPY VERBAL STYLE STUDY 


The second study to be reported is of quite a different mma 
Here the verbal behavior of persons in psychotherapy was cmd 
in order to: (1) Single out and describe some aspects of clients yo н 
therapy behavior; (2) to test the productivity, internal wr ipii qe 
of the classification System which was based on style of open 
and (3) to generate new hypotheses for further study of the psych 
therapeutic process as reflected in client behavior. - 
Twenty-four completed cases, which had been recorded and br re 
were selected from a larger research project. Cases were selecte 


z А Е Паге cases. 
Provide equal numbers of interviews from success and failure 

Therapist rating on a nine point s 
failure cases, 


rated one to th 
hours of thera 


pproximate proportion to 
the success and failure groups of the ] 


additional group of cases (all long) rated five (equivocally successful) 
was included. Thus the twenty-four cases included five long success 
cases, four short, Success cases, four long partially successful cases, 
three long failure cases, f. 


у . ition 
our short failure cases, and four attriti 
cases (failure cases with f. 


our or less interviews.) Cases with more 

experienced therapists were selected wherever possible within the 
limits of the other requirements, 

The second and nex 


іріп 
their appearance — 
arger population of cases. 


and third thirds of each therapy hour 


- more sub- 
› each response of e 


classes were found. 
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system concerned style 


The : 
three main aspects of the classification 
The aspects and their 


of participati 
sas rather than specific content. 
sses are described below. 


I. " 
LEVEL or Expression 


A. Analysis of Action 
Statements placed here state 
evaluate action, including cogniti 


and analyze, explain ог 
ve action. 


B. Analysis of Feeling 


Statements placed here а 
classify, etc., but focus on feelings as inne 


as classes of behavior. 


ttempt to analyze, explain, 


т experience not 


C. Responsiveness 
This includes expression of inner experi 
images, impulses, etc. They are expresse 
than being described, analysed, ог categorized. 


ences — feelings, 
d directly rather 


IL INVOLVEMENT 
This aspect involves voice quality alone and is judged on energy, 
pitch variation, pace, and pattern of emphasis. 


A. Emotional 

There is high energy in the voice, with a quality of dis- 
charge or overflow. 

inward in à ponder- 


В. Focussed 
the voice is turned 
tain irregularities of 


The high energy in 
There are cer 


ing fashion. 
but little fluctuation 


ing, explor 
in pitch. 


pace and emphasis, 


C. Externalizing 
directed outward with a speech- 


High energy in the voice is y 
making quality- The speech pattern 15 smoother than in 
B above, with emphatic though often mechanical inflections 
and a wide range of pitch 
D. Limited | 
Responses placed here are characterized by low energy 
used in а matter-of-fact fashion. The pace is even and 
tressed. 


relatively 908 
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III. Quarry оғ PARTICIPATION 
А. Participator 


This includes responses in which the client is a participator 
in what he is discussing. 


B. Observer 


The client is an observer of the scenes he is describing. 
For further details on the classification system and its reliability the 
reader is referred to Wagstaff (1959). 

During the course of the study, it was decided to disregard the 
sequence of individual responses and use a within-interview sequence 
comprised of samples of responses from each third of the therapy 
hour. The S matrix was therefore restructured in such a way as 0 
permit discrimination between thirds of interviews, but not to dis- 
criminate between successive responses. However, the fundamental 
form of matrix S was retained. The data matrix S was of order 
nq X twithn = 30, д = 24, and t = 44, n, q, 
tively, responses 
720 X 44 matrix. 


icted to the study of similarities between interviews. 
for analysis was (1/n)S'S 
similarity between the interviews on a scale 
between zero and unity. 

Matrix (1/n)8'8 is Shown in Table 17 - The odd numbered inter- 
views are early interviews, while the following even numbered inter- 
views are the late interviews of the Same case. Thus interview three 


is the second interview of the Second client, while interview four is the 
next to last interview of the same client. 


šuccess cases; 27 through 32, 
long failure cases; 33 through 40, short failure cases; 41 through 44, 


early interviews of attrition cases. 
The matrix (1/n)S'S was reduced by the m 
factor analysis. Two criteria were used for esti i 


of a sample of 150 residuals, and 


of the pro- 
Jections of the unit length vector, S'SY'/V/n VZD on the basis 
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vectors. When the sum of squares of the projections of this vector 
reached the value of 0.97, 150 residuals selected randomly were com- 
puted. The residuals formed a near-symmetric distribution with a 
mean of 0.023 and a median of 0.018. Considering that ¢ was 44, no 
additional basis vectors were selected. Three basis vectors resulted 
from the analysis. 

The three basis vectors had cosines of .24, 18, and .25, and thus 


were not far from orthogonality. The orthogonalization computing 


method described in chapter seven was applied with the results shown 
f the row vectors of S' 


in Table 18 which shows the inner produets o 
with the orthogonal basis vectors approximating the vectors of the 
simple basis. 

With three basis vectors resulting from а matrix whose maximal 
rank was 44, the rank differential r/(ng — 7 + 1) is 0.07, а reduetion 
of 93% from the maximal rank possible for 1 /nS'S. The index of 
productivity was 0.85. The low rank differential and the practically 


orthogonal structure of Ra as represented by the index of productivity 
shows the classification syste ally productive in spite 


m to be intern: 
of the small number of interviews. 


Descriptions of behavior types I, II, and III (which correspond to 


the three factors) were obtained by inspecting the columns of the 
original S matrix representing the behavior of clients whose interviews 
had loadings of 0.40 or above on one factor and .20 or below on the 


others. : 
Type I interviews (Interviews 2, 6, 9, 10) consisted almost entirely 

of responses classified 25 Responsiveness or Analysis of Feeling on the 
first aspect, Emotional or F ocussed on the second aspect, and Partici- 
pating on the third. In terms of the definitions given on p. 77 one 
could say that clients in Type Т interviews are primarily exploring 
feelings, sometimes attempting to analyze them and occasionally 
expressing them directly- Voice quality indicates that they maintain 
an inwardly focussed kind of involvement with occasional emotional 
They talk only about matters in which they are clearly 
gestion that these interviews become 


participators. There is some 5866 à 2 NS 
most intense during the middle third, with an increase in direct ex- 


on of present feelings and an emotional quality present in the 


overflow. 


pressi 


voice. А 
Туре П interviews (Interviews 3, 11, 19, 26, 37, 41) contain 
primarily responses that are classed as Analysis of Action on the first 
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б MATRIX 1/n S’ 
HE PROPORTIONATE ConnESPONDENCE* Between INTERVIEWS 


1 2 
9 € 5& 9 7 8 g$ iy ii 12 13 14 


i, 15 16 17 18 19 20 21 22 
2 30 50 

3 70. 

4 293 07 50 

5 20 10 47 33 

6 40 37 23 20 30 36 

7 33 00 50 40 63 23 

8 23 00 57 60 33 13 47 

t 33 50 10 07 13 43 13 03 5 


көзі 
oo me 
8885 
може 
38638 
юн 

8858 
Hoor 
2888 
ово 
$5955 
oun 
53588 
осе 
5888 
нына 
1538 
оосо 
338 
ныне 
2538 
ыы 
віш 


57 40 63 


"vm 
ea 
Во 
588 
омы 
в88 
әм 
888 
о ым 
$58 
mow 
588 
Rae 
$585 
ооо 
see 
ate 
$865 
өше 
555 
әз 
88 
өз 
58 


17 27 20 37 37 .. 
37 40 30 43 43 30 .. 
33 53 50 27 43 33 27 .. 
38 50 47 43 43 50 30 50 47 .. 
53 20 93 30 20 30 33 17 23 33 .. 
53 20 20 27 20 23 43 17 20 27 60 .. 
13 13 13 23 20 57 17 23 20 40 30 
27 47 43 23 33 30 20 63 40 40 33 
27 23 30 27 30 27 33 23 33 33 30 23 
50 43 47 00 10 30 13 93 40 33 27 27 
33 17 20 23 30 27 50 20 40 33 53 40 
13 23 20 20 23 50 27 30 23 47 57 
20 07 13 07 33 27 68 27 30 17 30 33 
07 17 03 30 37 
23 30 12 30 33 
03 27 18 13 13 27 27 60 20 » à 43 40 
dis ps E: A 20 40 33 60 30 37 23 37 37 
43 37 40 47 30 60 47 40 33 
23 60 20 30 20 40 37 
40 57 37 43 53 33 33 
23 37 30 23 57 40 37 33 
27 40 43 30 60 40 33 30 
33 53 20 37 23 43 30 
20 43 10 20 10 43 30 
50 43 47 57 23 37 30 27 50 47 23 27 
Е 23 40 30 33 37 
44 80 27 50 48 43 27 43 47 20 07 " = a ат x d 30 30 23 23 30 
47 23 40 33 99 47 40 43 50 


ЕГГГГІГЕГЕГГЕГЕЕГЕЕЕ 
ЧЕБЕВВЕВВБЧЕЗРЕНЕНЕЫБ 
3 жы Em epo oeo ceo to o o o eo 
S55588E5ENSSNSDSESNSEEBSS 
" эенезенеюнь 
S8855S8S888885888585858 
& нмазсанмавсанан 
S5g95959555888555859855 
А ы ороны ора ые 
BSESSSISSSSESSINSESaRS 
" E 
SSSSSSSRSESSNSRR88SR88 
өн Se VRB ONenyene 
SSSS8SS8SS8S8aS8N588S8N8 
2 зеппплванлоњо мо 
$888828585323889859558585 
оарвевосченюьяанаююе 
N885585589958588g82ds58Sq 
гоол аыл е з отса 
ВЕЗ5С35 52555558 
E 8222226220222 
8 8888835888535 
ЧУТ ен 
5868 as 85 

Е 

= 

8 

е 

8 

5 

в 

а 

5 


17 07 53 40 53 70 


ARR EEO 
$S5E888 
Co co tn poo) Uo 
2855288 
эннеен 
455885 
новы 
5858685 
омање 
868458 
spuseso 
$£988588 
мю о юю 
581588 
o2souo 
5558358 
wemono 
853853 
ы а 
18655 
ees 
888 

“ 

8 

Lo 

“ 

қ 

“ 

8 

“ 

8 

“ 

E] 

2 

E 


* Decimal points are omitted. 
1 Communalities are gi i i i 
given only for interviews used in Eroups. 
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2 43 44 
23 24 25 26 37 28 20 30 31 32 33 34 35 30 37 38 39 40 41 4 


OOND UN e 


10 40 27 61 

70 47 43 90 .. 
43 50 47 30 53 .. 
67 37 33 03 57 53 70 А 
63 30 23 00 53 43 73 08 3 
30 ат 20 57 47 33 17 20 >. 
70 43 40 13 70 57 


57 43 33 10 57 43 57 50 1547 ue 
50 63 57 37 60 53 40 30 40 S aris 

77 40 Бо 00 63 50 70 00 27 17 ay 50 43 = 36 
40 47 40 20 43 47 40 23 30 23 20 47 13 40 70 37 
20 43 27 63 38 38 20 10 599 75 41 то 47 80 83 < 38 
So аз ag 43 60 Б 48 30 D TD sg 50 00 37 00. 39 
67 43 37 23 70 50 Б a Es өз 47 40 57 27 23 40 67 id қ P 
73 33 40 10 63 40 E 10 60 23 20 47 із 37 70 47 27 23 55 m 
20 40 13 63 27 30 2 53 33 00 67 57 73 43 27 47 47 43 " = 
67 43 40 17 57 43 60 40 33 53 57 40 67 33 23 40 47 47 23 67 E 

60 33 43 10 2 2 = әт 43 37 37 63 30 43 47 53 37 37 50 40 23 .. 44 
37 47 30 43 4 
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TABLE 18 


ORTHOGONAL APPROXIMATION TO OBLIQUE STRUCTURE 
(DATA oF TABLE 18) 


| ен» I n III Interviews І II IH 
i. 48 14 35] 24... . 45 -50 35 
2. 69 — 06 —041| 25... . Ag y — 42 
8. Jo 32 46| 26... 12 78 -03 
4. 10 — 48 е|ј27.... 10 38 70 
5. 30 — 35 — 45| 28. . 10 38 50 
6. 56 13 лај 99... . 15 ло 8 
7. 30 — 30 67| 39... 090 59 
8. 90 38  .6|3.... 0  ;2 19 
9.... 72 05 от] 32... в 79 
10.... 56 -03 —93|33.... s 16 6 
Boos № в ји. ar нв 4 
18... . 49 45 00] а i5 06 7 
В... 49  .& e| 85... do ш м 
14.... 238 лв 00| 37... 342 в 12 
18... . 49 16 22| 38.... 16 .53 42 
16.... 52 .34 — 46| gg 10 28 75 
Crass МЕ oz и... ме м м 
We... 46 шора 00) 10 80 12 
ewes MD NE IP WDR" 35 — 64 
2.... M — 45 [ав 49 — 16 — 
Blum a i 40 38| 44... 99 .55 ЗІ 
Pessu A Al 34 | LV. Аз 59 66 
2532... 7 АБ — | 


aspect, Limited on the second aspect, and Participating on the third. 


sis of Feeling response. In the final 


Ба ро! е evenly divided on voice quality 
among Limited, Externalizing, and Focussed. Translated into the 


gs and never expressing them 
ghout the first two-thirds of the 


of involvement in what they are 
Te 15 an increase 


with an outward 


saying. During the last third the 


~ Р : of energy in the 
Voice, either focussed inward or 


; Speech-making 
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qualit li 
«du 5 е 
У. nearly all the responses they are discussing matters in 


"А they are participators. 
go e e 23, 29, 30, 32, 35, 40, 43) con- 
iram Jodi» : | a c assed as Analysis of Action, with 
Gn баса 5 : of Feeling response 1n each third of the interview. 
анти ~“ ity aspect, Involvement, all but two responses were 
мън ЧЕ, the other two being Limited. On the third aspect, 
ja сея e evenly divided between Participating and Observing. 
ње m the definitions on р. 77 one could say that clients in 
ме че Pacem are primarily concentrating on describing and 
ign p ^ eir own behavior, with rare attempts at analyzing feel- 
conned е uS are never expressed directly: There is a good deal of 
ibakin 5 - voice, but it is used in an outwardly directed, speech- 
учи ashion. Occasionally the voice takes on an evener, matter- 

BN ‚ quality, but it is never focussed inward, nor is there any expres- 
sion of emotion. About half the time they are discussing events in 
which they are participators but the rest of the time they are merely 
observers of the scene. There are no time trends for thirds of inter- 
views; the client's style of participation does not change as the inter- 


view progresses. 
In assessing external pro 
of significance or better that: 


5 


ductivity it was found at the 0.05 level 


s cases exhibited more 


jews of succes 
rviews of failure cases. 


1. Both early and late interv 
and later inte 


type I behavior than early 
avior was more characteristic of the late interviews 


f five on the nine-point success scale than 
r-cut successes or failures. 


2. Type П beh 
of cases with ratings 0 


of late interviews of cases rated as clea 


te interviews of failure cases showed more 


han did the success cases. 

apists exhibited more type I 
type ПТ behavior in late 
erienced therapists. 


3. Both early and la 
type ПІ behavior t 
erienced ther 


4. Clients of more exp 
erviews and less 


behavior in early int 
interviews than did clients of less exp 
early interviews correlated .46 (rank order) 
with favorable changes in self-description on Q-sorts by clients 
from pretherapy to posttherapy- The same behavior corre- 
lated .47 with decrease on Taylor Anxiety Scale scores. 


5. Туре І behavior in 
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6. Type I behavior in late interviews correlated .39 with post- 
therapy scores on the Barron Ego-strength scale. ih 

7. Interviews showing high type II behavior correlated .42 wi 
client evaluation of outcome. 


TABLE 19* 


MATRIX Ro 


POLITICAL ATTITUDES 
[D T£ 


? 8 4 5 6 7 s о 10 qn 1 13 M 15 
1 75 0 0) ва по æ 38 6 08 18 4 2 
2 12 00 04 07 01 02 07 03 00 08 04 00 10 a 
3 H 02 01 08 04 00 07 02 07 02 04 05 d 
4 41 00 00 18 10 13 11 25 05 08 25 3 
5 39 00 07 20 12 05 29 05 09 27 n 
6 30 06 02 12 04 11 05 05 12 ~ 
7 31 00 00 06 19 06 06 21 0 
8 32 00 05 22 05 07 20 05 
9 37 09 24 04 09 23 05 
10 20 00 00 08 09 03 
11 65 00 11 51 03 
12 15 03 04 08 
13 22 00 00 
14 64 00 
15 14 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 


* Decimal points are omitted. 
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nd III behavior correlated 
frequency of voice quality 
ch were judged on an 


8. Rank of loadings on types I, П, a 
+.60, +.04, and —.60 with rank of 
categories of therapist response whi 
a priori basis to be therapeutically optimal. 


о 
= 
to 
~ 
m 
52) 
о 
e 
о 
о 
о 
15) 
O0 -1 CO C1 O0 t9 н 


02 04 08 02 04 08 03 06 05 


16 11 02 25 06 25 


24 02 13 09 18 


* Decimal points are omitted. 
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THE POLITICAL ATTITUDES STUDY 


Our colleagues, Professors David Easton and Robert D. Hess, col- 
lected as part of a larger study, data on the ideal political attitudes 
of high school adolescents, The questionnaire used had items of the 
form, “A United States senator is personally very much in favor of а 
bill which the leaders of his party do not want passed. He should 
(а) vote for the ЫШ, (b) vote against the bill, (c) not vote at all. 
The items could be construed as requiring the student to choose 
between two groups of individuals or influences or not to choose at all. 
From the items, a group of ten judged to be the least ambiguous were 
selected by Michael Rogin for an exploratory analysis (unpublished). 
Nine of the ten items had three alternatives; one had but two. The 
items were coded with respect to the alternatives with Roman numer- 
als to the response alternatives as follows: І- (1) senator, (2) party; 
(3) neutral; I- (4) senator, (5) people, (6) neutral; II- (7) ex- 
perts, (8) people, (9) neutral; ТУ- (10) friends, (11) senator, 
(12) neutral; V- (13) president, (14) senator, (15) neutral; VI- 
(16) president, (17) senator, (18) neutral; VII- (19) pressure, (20) 
people, (21) neutral; VIII- (22) advertising campaign convincing 
Voters, (23) Senator, (24) neutral; ІХ- (25) stay on the job, (20) 
go home and defend self against attack; X- (27) senator, (28) 
neutral, (29) party. 

The principle object of the research was to discover the number 
and nature of different бурев of ideal images of the manner in which 
politica] decisions Should be made. 

From the 600 respondents, 100 were selected at random. The 
Ro matrix of joint proportions for tl 
Table 19, Inspection of the matrix revealed that the sums across 
columns of vectors 4, 9, and 17 gave a column vector with entries 
Proportional to those of the first centroid vector except for the values 
corresponding to the items for which vectors 4, 9, and 17 were alter- 
natives. These vectors, with corrections underlined, are given in 
Table 20. Тһе approximately orthogonal basis corresponding to the 


oblique basis (vectors 4, 9, and 17) and the projections thereon are 
shown in Table 21. 


That the internal productivit 
was high is shown by the rank d 


he 29 alternatives is shown in 


y of the classification system (items) 
ifferential of :18, low compared to its 
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maxi 79 : : = 
imum value of unity. The index of internal productivity, not 


в. is obviously above 0.80. 

he interpretation of the data analysis whic 

unpublished study by Michael Rogin. 
ased on Table 21. 


h follows is taken from 
His interpretation is 


TABLE 21" 
MATRIX, F, OF LOADINGS 
ON APPROXIMATE ORTHOGONAL BASIS 


TABLE 20* 
Maux А 
OF PERPEN 
SNDICULAR PROJECTIONS 


On Basis VECTORS тн 4 1 Ш Ш 

1° 56 37 44 

1 35 37 I 2 06 03 10 
2004 03 3 -02 16 04 
3 02 07 4 43 15 13 
4 23 13 II 5 10 14 44 
5 п 2 6 06 27 01 
6 07 12 7 35 02 17 
т B 0 ш 8 10 19 28 
8 10 12 9 14 36 14 
9 13 18 10 14 13 18 
09 ІУ 11 a3 37 44 


10 11 09 
12 os 06 05 
13 10 16 15 
у 14 34 33 46 
15 15 07 03 
16 19 0 08 


17 13 15 б 


16 п 07 VI 
Sois B 18 25 32 06 
= в а) 19 20 06 08 
i9 ü 00 уп 30 28 20 46 
| 1j 21 09 30 04 
or d ДУ 22 o7 00 09 
2 ow 02 уш 23 41 35 50 
o 9 № 24 n 21 00 
а 0 1 yc 825 32 47 40 
25 % B 26 26 09 19 
27 55 и 4 


26 15 09 
27 36 29 


28 04 
29 01 01 


* Decimal points аге omitted. 


x 28 03 п 08 
29 о 0 01 
Y 58 57 58 


* Decimal points are omitted. 
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“The most significant loadings, high and more than double the 
others, on the first basis vector are 4, 7, 16, and 19. Alternative 15 
although having a low loading is pure on this basis vector. Alterna- 
tives 4, 7, and 16 suggest that the first type is characterized by choice 
against the people wherever that choice is permitted by the question- 
naire. Alternative 18 Suggests a refusal to choose bet ween the senator 
and the president. It seems that the type represents a choice of 
authority figures over the people.” В 

“The third basis vector (high and pure loadings on 5 and 17) Ја 
characterized by the choice of the senator or the people over the presi- 
dent. This suggests that, associated with the ideal of popular 
sovereignty, which is clearly the Базе characteristic of this typer 15 
the ideal of the rational man, free from pressure, who makes up his 
mind on the basis of the issues. It should be noted that neutral 
alternatives tend not to have loadings on the third basis vector. The 


loading on this basis vector dominantly 


“Тһе pure loadings on the second basis vector (6, 9, 21, and 24) 
people and authority. These 
possible choices. This result 


at ‘mass politics’ results in dislike of or 
233’ machine politics and ‘rational popular 
sovereignty’.”” 
“The data Suggest that there is a continuum with popular sover- 
eignty at one extreme (as an ideal) and authority at the other. The 
Senator represents for respondents at both extremes, the independent 
rational man acting on the basis of the issues on the one hand and 
authority on the other. The senator is perceived by one group as the 
bulwark of authority and by the other as the bulwark of democracy. 
Thus the interesting fact that the senator alternative is chosen more 
often than one would expect whenever the choice is ambiguous be- 
tween authority and democracy, as often it must be. Although these 
respondents were adolescent students, the data suggest why the sena- 
tor has so much prestige and influence in our society. His image is 
congenial to groups whose images of each other are negative,” 
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TABLE 22* 
Матвтх FQ 
Пет Alt. I Il ІП 
1 96 65 76 
I 2 10 07 17 
3 -04 28 07 
4 74 26 22 
II 5 17 25 76 
6 10 47 02 
7 60 03 29 
ш 8 17 33 48 
9 24 63 24 
10 24 23 22 
1У 1 57 65 76 
12 14 10 09 
13 17 28 20 
v 14 59 58 79 
15 s» № 9% 
16 з 16 и 
ҮІ 17 22 26 16 
18 43 56 10 
19 35 10 14 
VII 20 8 35 0 
21 в 55 07 
22 12 00 n 
vun — 2 а * x 
24 i9 3 0 
х 2 5 8 Ш 
26 45 16 88 
27 95 72 BL 
x За 05 19 14 
29 o 02 02 
Y 1.00 100 1.00 


* Decimal points are omitted. 
TABLE 23 


MATRIX; QQ, OF Crass PROPORTIONS 
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COMMENTS ON APPLICATIONS 


Of the applications of exploratory analysis offered, only the analysis 
of psychotherapeutie interviews exemplifies the naturalistic a 
tion of free, uncontrolled behavior. Subjects taking the Halstea 

test were limited to answering one of four possible alternatives. i 
answers were rewarded by a chime and punished by a buzer ди 
correct and incorrect responses, respectively, a learning situation 
with limited and standardized response alternatives was being studied. 
A closer analysis of the coordinates of the data vectors would possibly 
have revealed some effects of alternating reinforcement on the differ- 
ent alternatives. The usual factor analytic procedure, based on 1 
16 X 16 matrix, because of dichotomous scoring, would have revealec 
the growing ability of the subjects to grasp the prineiple underlying 
Correct responses. In the present analysis the first factor WOR, 
detected as a simplex and it appears that the method of analysis here 
used may identify sub-matrices of simplexes in data matrices. The 
usual factor analysis would not have revealed systematic pattern т 
the wrong answers for the simple reason that the wrong answers 
would have been condensed by implication at least into a “not-cor- 
rect” class. The difference in type of result to be expected is due 
entirely to the fact that in conventional f: 
focus is upon the correct or “positive” 
dichotomous case, eliminating all re 
among a specially designated sub-cl 
dichotomous case, 
whatever informat 


‘actor analysis of items, the 
responses; thus, except in the 
gularities in response except 
ass of behavior. Even in the 
the usual factor analytic procedure does not use 
ion is contained in the first principal axis of the 
total data matrix which shows explicitly the classes of responses 
collectively designated as incorrect, 

The use of factor analysis in the study of attitude items requires а 
judgment that a certain sub-class of response is "ideal," “optimal,” 
"good," ete., such norms usually being collectively designated as 
positive. No study has yet been conducted in the knowledge of the 
writers in which the sub-collection of Positive responses was desig- 
nated as positive by resort to a table of random numbers although it 
would be possible to do so. While normative decisions as to what 
sub-class of responses to analyze cannot be criticized as such, these 
decisions to restrict analysis to 1 /q of q categories of response might 
be viewed as one of the possible outcomes of analysis rather than as 
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preliminary to analysis. After all the classification system itself is 
In part a product of normative decisions; it seems desirable to mini- 
mize such decisions to the extent possible because such decisions are 
risky. The greater the number of classes in the classification system 
the more risk is attached to selecting for analysis just one of the 
classes. 

In the study of psychotherapeutie interviews the classification 
system contained twenty-four main sub-classes. The usual factor 
analytie procedure would have required some decision as to which 
one of the twenty-four eategories represented, for example, the optimal 
class of response, the remaining twenty-three categories being rele- 
gated to the non-optimal class for purposes of analysis. The result- 
ing data matrix would then have had but twenty-four rows, had a row 
analysis of the data matrix been done. But the entire classification 
system itself was based on the investigator's opinions as to what was 
optimal or non-optimal and the sub-categories represented judgments 
of various levels of optimality. One main purpose of the analysis 
Was to ascertain from the relations among responses, categories, and 
criteria those sub-classes of behavior which were optimal and non- 


optimal. 


APPENDIX А: TESTS OF 
SIGNIFICANCE FOR THE 
NUMBER OF BASIS VECTORS 


NER РА ; of 
Іп some situations it is to ђе expected that for a given set 0 


Tesponses, n, and for a given number of subjects, ё, no sequence 

effects will be observed or are expected. This situation holds = 

some types of conversations, achievement and aptitude tests, an 

Personality and attitude inventories. When such is the case the 
n 


Summary matrix У 5; may be analyzed. There are n matrices Si 
1 

each of the S; being q X t. 

of the q sub 

@- )( – 


n e" 
The row vectors of Z S; are the centroids 
1 : 
2 with 
-classes of the classification system. The total x^ w1 
1) degrees of freedom is 
xt = nt(s — 1) (9 
here n is the number of responses, 
the sum of the diagonal elements of 


Ж t the number of subjects, and s 18 
п а 0) 
О = 1/n (25:58 D+ w 
% а 
With D being the diagonal matrix whose elements are the column 
(or row) sums of (3 82 st) and with W being the ах q matrix 
1 1 


with elements 1/4, 4 being the number of categories in the classifi- 


93 


APPENDIX А 


cations s; 
ystem. The value of x3, associated with the second latent 


root, is given by 


z [pram Ба врие 
1 1 
(8) 


xs/nt 
хрх! 


where а i 
vis ^ f i 
s the second latent vector expressed as а row matrix. Denot- 


in 2 i 2 2 
g the correlation ratio by n» (3) becomes 


2 


ж = пт 


(4) 
is the second latent root of 


Besides bei А А 
esides being а correlation ratio 72 
ditional latent root 


D-12 Ш п 
(1/n) 2 5; 2 S: D^. Upon taking each ad 


The partition of x? is shown below 


t 7 
Wo degrees of freedom are lost. 


PARTITION OF x 


x 


Roots dif. 
т @-)+¢-0 nts 
Я à-1)4 фер) == піт 


(by subtraction) 


Remaining Roots (by subtraction) 
nt(s — 1) 


Total (q— 00-1) 


are significant there rem: 
latent vectors may be extracted. 


When the remaining roots ain additional 


significant dimensions and other 
However, the per cent of the total accounted for may, at times, be so 


high as to make extraction of additional vectors unprofitable. 
When 7} is the only significant correlation ratio, one dimension 


accounts for the data under the hypothesis that there is an underlying 


scale. Then taking 

a/m — 6 2 S; (5) 
elements are the elements of the 
g the entries of (5) to zero mean 
res) gives a set of weights for 
Jassification system which best discriminates 
among subjects in the sense of least squares. For r latent vectors, 
there will be (r — 1) sets of scores, the first latent vector, the trivial 
vector, not being used. ‘The scores for each subject are obtained by 


summing down the columns of (5). 


matrix whose 
d transformin 
to standard sco 


Where G is a diagonal 
second latent vector àn 
and unit variance (e+ 
each category of the ¢ 
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et 
The scores being optimal in the sense of least squares, no мр 
of weights will be better than the weights of (5). мака 9 iem 
sets of weights may be as good or almost so. In particular M ме 
for categories derived from orthogonal bases approximating simp. 


à E ee à res for 
bases can be expected to provide effective w eights and scor 
subjects. 


22 У Ge ve 
The number of significant latent roots plus one of 2 S; can Ser 


; n Б w- 
as an estimate of the rank of S albeit а very conservative one. Но 
Д 


ever, опе сап be sure that if X 
1 
S has rank of at least r = 


A detailed consideration of tes 
given by Bock (1956). 


S; has г significant latent roots, then 


roots 15 
ts of significance for latent roots } 


APPENDIX Б: РВООЕ ТНАТ ТНЕ 


SEQUENCE OF MATRICES 
Ra Rises +> Rn CONVERGES 


TO AN IDENTITY MATRIX 


L. HARMON HOOK 


UNIVERSITY OF CHICAGO 


е matrix whose 


metric positive definit 
it basis vectors, 


Suppose we are given a sym 
f the appropriate un 


elements аге the inner products 0 
Le., 
(а, са) (ап og) ««« (at an) 


(оз) ой. 


\ 


Ra 
(ат, 68) (an 02) -+> (олу ба) 
lization process or equivalently 


By the Gram-Schmidt orthogona 
by the diagonal factoring method, we may obtain an orthonormal 


defined ав follows: 


са lll 


basis єр, € + + o €» 


a= 
e = [ог — (2» e)al/|laz — (9 «да! 
аме ӨНЕГЕ. = — 

e; = les — $ (о) 6) в/в; =a (а; еде |. 


i=l 
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We also have, 


а = (ап, ala + 0-6 +... + 0-е, 
аз = (оз &)& + (оз, е)е + O-e +... + 0-е, 


аһ = (an, ale +... "Fs, бијен. 


It is now possible to find unit length vectors ài, à», . . ., б, such that 
C;20, 2j 
буа) = ү? 57 
виа) = [c 45 
for %і1,2,... тп. Clearly, we have 


9i = (61, ale + (à, ©) +... + (б, ве, 
ё = (6, а)а + (5, де +... + (52, ел)ен 


9, = (ôn, еја + (ôn, ©) +... + (ôn елден. 
Let YS ats,7=1,2,... n; 


and let y; = У, ће. y: 
is y; normalized. 


We are interested in the inner product 
(ә v3 = Gs уда] lyi 
= (a; + ôi а, + бр Ла, + 5. - о; + 5] - 


For i = j, this inner product is Seen to always be unity. In what 
follows, assume that 4 >< j. Then 


; 
15 a unit vector in the direction of y; or 


(voy) = 


1 
lor: + 4 | - је; + 5, ЕЕ did (e, 8) + (a5) + (ès | à 
but (а, 8; = Ofori == j by construction, so if 


ке 1 
las + 241-6 Fag’ 


we have 


(vs ү) = Kilas, а) + (8;, 5;)]. 
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It 

"s rn е observed that the vectors yy Ys + Yn constitute a 

лен кіна те и for which ап, ав ++ „ An ATE а basis. Thus 

ieri wn e | a TAREE В: similar to В. in the sense that it isa sym- 
positive definite matrix whose elements are the inner products 


of Si, 
the basis vectors yu vs << 5 Те VIZ 


(vy т) Qv 79) + Qm Yn) 
" (уз, v1) (уз Y) + (уз, 7») 
= 4 


Qr Y) (Ym v2 ++ (vs, ТЈ. 


Since 

К(ов ai) + (ә 8j] 

e two matrices. In particular 
y, is in general а basis closer 
a, Again, if 


Jace of Ra, we should obtain 


ng a basis for the space of 
the property that it is a 


(vi тд = 


there is some relationship between thes 
if (0, 6,) < 0, then the basis Yn 79559 
to being an orthogonal basis than the ba 
the above steps are repeated using Rin p! 
new vectors On, 97) ++ » азу, constituti 
Which oj, o», .. +) @ 88 basis and having 
more nearly orthogonal basis than that of т» 

The remaining arguments will indicate tha 
steps will indeed lead to 2 unique orthonormal basi 
as that for which a, 0% += 2" is a basis. Further properties 0 


uniquely determined basis will be examined later. 
Returning to а more exact discussion, We have the following 


hypotheses: 


sis an 90) ^ ^ 


әз. Yn 

+ repetition of the above 
s for the same space 
f this 


(ағай Z 0 for all? and 7 
Ж . [G2 aizi 
м a) = | 19}, 
see what restrictions these hypotheses impose on 
тапа j let 
|. отићи) 


and we want to 
(5,5). Fora particular 

M = lak = 
is consists 
and j selected above. 


of all the a1, a; +++) 0% except a; 


hose bas 
By construction: 


ticular ї 
6, L M and ô; L M. 


be the subspace W 
and a; for the par 
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In conjunction with hypothesis Ж, this implies that 


9i L (M, oj] and à; L (M, a]. 
Thus, when à; |. 8; we must have 


ief M, a] 


ino 4 Я Ав 
ав 6; 1 (M, ог) and аз (8; а, М) generate the entire space. " 
А : я 1 
ô: L M, 6; is а scalar multiple of a; However, since both are ur 
vectors and chosen in the same direction 


д; = a. 
Ву symmetry, a similar argument is valid for 6;, and we have 
j = oj. 
Thus, (ауа) = (85, 6 
The converse of th 
tion of the ё; Thus 
This implies th: 
and oj, we sh 


j = 0 when it is supposed that (б ô) = 0. 

е above argument is trivial due to the construc- 
; We conclude that а; L а; if and only if 6; L 5 
at if we originally have orthogonal basis vectors а! 
all already obtain orthogonal basis vectors y; and 77 


because 
Gs v) = ККаь о) + (6, 85)] 
= К.0 
= 0. 
Having dispensed with this case, we may change our hypotheses ж 
to read 
(а, а) > 0 for all i and j 
С;> 0,5 =7 
MICE [o e" 
U 


sing the same notation as above, let us treat the case 
Case 1: a; L M and а; 1 М. 


In this case œ; аҙ бг and 6; all lie 
all in one plane. 


Project a; into |а, i.e., the subspace of the 2-space orthogonal 
to the subspace generated by аҙ and call this Projection 6;; similarly, 
project а; into |а! and call this Projection 6), 

Construct 6; such that 


in the same 2-space, i.e., they are 


8: = ot |o: 
i.e., ô: is colinear with 6; and is of unit len 


Р | gth. Similarly, construct 6, 
colinear with à; and of unit length. 
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Now let 
and е ајде 
200 ПМ 
ала y; = у. 


Now consider 
(ет) = (ai бо 9 4-8) 
= (оу aj) + (6; 69 
= 1011-1691 cos? 
+ [l:l] lô; 1l сов % 
= совб + cos $ 


ngth, where the angle 0 is the 


angle between 5, and б; 


beeaus 

ause a; аҙ бг and 4; are all of unit ler 
2 

and ¢ is the 


деше: 
acute angle between o; and ај 


Referring to the diagram; we have 
ф=т-@ 


ала совф = - 050. 


| 
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Thus, Gs Y) = 0 = (ai, а) + (8,6) 
and (5,8) = — (а, а) 
ог 


(858) < 0 as (а, а) > 0 by hypothesis. 
Let us now consider the following less restrictive case 
Case 2: o; L M and аз по | M. 


Since (ауб) = С; > 0 


(а; 8) = 0, 


we can write 6;as а linear combination of a; and ô 


and 


у; ме, 
6: = аа: + a; 
= с; = а (оа, а) + а (о, 8) 


= ai, 


апа віпсе (а), 6) 


we have a; > 0. Also, 


(6; 6) = ailan ô) + а,б), 6) 
= аҙ. 


Supposea; > 0. Then (а, 6:) = Oimplies that 


(aj, а-о: + а;:5) = ailai, о) + аа, 0j) 
= 0. 
Thus, since (03, $) > 0, (0а) > 0, and a; > 0, we must have 
a; < 0 which is a contradiction. We must then conclude that 


а; = (5,5) < 0 
ав before. 


The most general case is 
Case 3: a;not 1 M and a;not | M. 


Let P be the projection into {655} along M, i.e., the orthogonal 
projection onto the subspace generated by à; and àj. Let Г be the 
identity operator. Thus, for үеМ,Рү = д and for any vector B, 

В = РВ + (I — pg, 
where РВ is in the 2-space {6;, ô;} and (I — P)Bisin M. 
In particular, (o; 6;) = 0 implies that 
(a 8) = (Pes 8) +10 — Ра, 53 
= (Ро, 0j) 
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since 5, L Mand (I — P)asisin М. Thus (a; 6;) = 0 implies that 
(Ро) 8) = 0. 

We may now apply Case 2 to Pa а Ts 
show that 


Similar arguments 


à; = ђ,Ро + 56; 
ала (ô; Ра) = Ра» Род + 5/6» ра) 
= (Раз, Ра). 
Also, 
0 < C, = бре) = (à; Pai + g- P)a:) 
= (6, Ра) + Qs (I — Р)о) 
= (б, Ра) 


since (I — Р)а; is in M and 6; 
0«C;- b, (Pas Ра); 
have b; > 0. Thus, Case 2 is 


1 M. Hence, 


and since (Pas, Рад > 0, We must. 
applicable and once again we conclude that 
(ôi 6 ) < 0. 
We have just shown that in all possible cases (6,5) <0. Hence, 
since 
(vis 2) = Kl(as a) + б» 59. 


we must have 
(уә) € (ai, а) 


as К > 0. Also 
o € Ira 791 < (ағ 22). 


APPENDIX с: THE SPECIAL CASE 
OF THE DICHOTOMY 


:t has been 
Тһе ease of the dichotomy is one of special interest because it т 
treated extensively by Lazarsfeld (1957), Green (1953), Gibson py 
and others in connection with latent class analysis (Lazarsfeld, 195 


S а ЧА ous 
There are two formally similar but different cases of the dichotom 
classification. In one case the els 


categories. In the other the classifica 


two categories but the investigator reduces the system to a Ы еліні 
by deciding to classify one of the sub-categories as the “correct” o! 
“positive” or “desirable” or the “most interesting” category. Such 
concentration on a single category is equivalent to selecting an 
n X Lsub-matrix corresponding to a single response. In other words 
a sub-set of the data is selected for analysis, 

In latent class analysis it is postul. 
jects сап be divided into r clas; 
being that for the subjects in 
mous items are st, 


MS vst Ew sib: 
fication System has just two ез a 
i re tha 

ition system may have more th 


ated that a population of suh- 
ses, the key characteristic of the r classes 
each class the responses to the n dichoto- 


atistically independent, Thus the most important 
assumption is that the probability of 


а set of responses of a subject 
belonging to one of the r classes is the 


product of the probabilities of 
the responses to cach item of the Set. The е 


quations of latent class 
analysis may be written as follows: 


Тер Фри + pin +... + %, 
Pi = Dipti + рири: +... + ур; 


Du = pipupi; + pupugn; +... + Рурмру; 
Pot = ppypipu + pupuquupn, +... + Арх ль 


| 
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Баала оп the left are empirical proportions and 
ai 2. ns are theoretical. These are T, ША latent 
iin E XR expressed in па НЕ notation all cells with recurring 

scripts represent theoretical values; those with non-recurring 
subscripts, empirical. 

The matrices of empirical joint proportions of whatever order 
can be derived from either the reduced n X € sub-matrix or from the 
full dichotomized S matrix. When the full 2n хі dichotomized 
S matrix is used the matrix of joint proportions corresponding to pairs 
contains the empirical proportions Риш the diagonals and zero in the 
off-diagonal cells of the principal sub-matrices corresponding to the ni- 


І s ins 
n latent class analysis other values, analogous to the communalities 
These values then limit the entries 


trices of higher order joint propor- 
In a matrix of propor- 


of factor analysis must be used. 
appearing in the diagonals of the ma 
tions and in certain of the off-diagonal cells. 
tions corresponding to the triples, the general clement is Рик but 


some elements with recurring subscripts Piit ete., will appear. Such 
1 unless the data are treated asymmetri- 


elements must be estimatec 

cally. At the present stage of development. of the subject this in- 

volves treating only part of the data. Thus when data are dichotom- 
”), there is a selection 


ized (rather than being “naturally dichotomous 
among the data vectors which disregards information by selecting n 


from among nq data vectors; then à further selection must be made 
in order to avoid estimating unknown values. On the other hand, 
if all of the data are to be used, then far more than the ng values to be 
substituted in the diagonals of Ру must be estimated. Finally, as in 
faetor analysis using communalities, the statistical rank of Ro with 
communalities in the diagonals must be estimated. 

In view of the fact that the data of S matrices can be changed into 
dichotomous classes; latent class analysis can be considered to be an 
ve to the analysis of empirical S matrices. It seems, there 
ean analysis of an Ко mat rix for which the 


ompar 
is known with a latent class analysis of the 


alternati 
fore, appropriate toc 
true latent structure 
same data. 

Table 1 shows à 
non-empirical values, 1 


n Ro matrix for data with three latent classes. The 
ecessary for latent class analysis, are under- 
lined. The true Jatent structure is shown in Tables 2 and 3. Тһе 


structure arrived at by analyzing Ro only with empirical values in the 
diagonals and in the underlined off-diagonal entries is shown in 


"Puo are SJUA WUN + 
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renew 5. a solution of Table 5 was arrived at by taking the 
data i ане is identifying the vectors of the oblique simplest 
parison Ke E v? а une on the oblique simplest basis. Com- 
soweit ойн structures shows clearly their essential similarity 
an i E differ. The degree of similarity probably could 
плав Е b further rotation. The number of basis vectors 
imei i јан the principal axes 18 three and the correspondingly 
corsi " ROW vectors have the same patterns of high and low inner 
ma Гог purposes of hypothesis formation and inference it 

em that the two approaches give similar structures for 


dichotomi 5 
hotomized or dichotomous data. 
TABLE 2 TABLE 4% 


STRUCTURE DERIVED From Ro 


LATENT Strucrure* L 


* Decimal points omitted. 


TABLE 5* 


* Decimal points omitted. 


TABLE 3 


*N 


LATENT STRUCTURE 


* Decimal points omitted. 


* Decimal points omitted. 


APPENDIX D: ANALYSIS OF 
WEIGHTED DATA VECTORS 


е "ueture 
In Chapter II the conjecture was advanced that if simple structu 


basis vectors Possess factorial invariance, then the matrix of principal 
axes M, of weighted data vectors would also be a factorially invariant 
matrix and would be more nearly invariant than the simple structure 
factor matrix. The basic reason for the hypothesis of more stability 
or invariance of factorial structure was that the principal axes are 
uniquely determined by all of the data vectors. 

Tables 1 to 7 show various factor Matrices for a set of body 
measurements used by Thurstone (1946) for teaching purposes be- 
cause the data showed such a clear and interesting simple structure. 


~ 1 я ighest 
The matrix of correlations Was essentially non-negative, the highes 
negative correlation being —0.05; 


suitable for discussion here, 

data vectors on the varimax $ 
solution is presented ins 
the factor pattern for the 
oblique solution were ide 
of factor pattern, a fact 


hence the correlation matrix is 
Table 1 shows cosines of unweighted 
imple structure factors. The varimax 
tead of Thurstone’s ob 


lique solution because 
orthogonal y 


arimax solution and Thurstone's 
ntical using Thurst one’s criterion for identity 
or loading of no less than 0.35, with loadings 
between 0.30 and 0.35 being considered of doubtful significance and 
loadings below 0.30 being considered non-significant. 


Table 5 shows matrix M, th f coordinates on the principal 
axes of the we ble 6 shows the cosines of the 
weighted al axes of the weighted data 
vectors. as the highest weighted invari- 


e matrix о 
ighted data vectors and Ta 
data vectors on the princip 
Variable 9, head breadth, h 
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ance among the weighted data vectors. Visual inspection of Table 6 
shows that variables 8 and 10, also head measurements, have very 
high cosines with variable 9 and with each other. Clearly variables 8, 
9, and 10 are variables in a cluster with variable 9 as the first pivot 
vector. Of the remaining variables not in the first cluster, variable 12, 


hand breadth, has the highest weighted invariance value and is the 
second pivot vector therefore. Variables 5, span; and 11, hand length, 
obviously have substantial cosines with variable 12 and constitute 
the second cluster. Even if variable 5 is not included in the cluster, 
this is of no moment because it has a low weighted invariance value 
and hence cannot be a pivot vector for a cluster. The variable with 


the highest weighted invariance value not in the first two clusters is 
variable 2, sitting height, which obviously has a high cosine with 
variable 1, stature, again a variable with a very low weighted invari- 
ance value. The variable with the highest weighted invariance value 
not in the first three clusters is variable 7, chest depth, which clearly 
th variables 3, 4, and 6, which are, respectively, 
and chest preadth. In this instance 
gonal data 


has high cosines wi 
shoulder breadth, hip breadth, 
variables 2, 7, 9, and 12 constitute the most nearly ortho 


basis for the unweighted data vectors. 
It is noticeable that the varimax 8 
vectors, Table 2; the principal axis solution for the weighted vectors, 


matrix M, Table 5; the matrix of cosines on the principal axes of M, 
Table 6; and cosines on the basis of pivot vectors, Table 7; all identify 
the same vectors 28 belonging to clusters even though for Table 7 
basis vectors 2 and 12 have a cosine of — 899. In the weighted data 


rs of the basis serve only to group variables into clusters. 
amental matrix for weighted data vectors; the 


from М, shown in Tables 2, 6, and 7 are 
vectors of M useful only to facilitate inter- 
ectors are not interpretable in factor analytic 
ables. They are not “factors” and in par- 
are not interpretable in factorial terms. 
all lead to the inference that there are four factors, 
trinsic stature, one for girth, one for the extrem- 
These “factors” are, however, inferences 


olution for the weighted data 


bases vecto 
Matrix M is the fund 
other matrices deriving 
rearrangements of the row 
The basis v! 
pothetical vari 
entries 


pretation. 

terms as hy 
ticular, negative 
Tables 2, 6, and Т 
one for long bones or in 
and one for head size. 


d not basis vectors. 
olution, Table 1, the same factors are identified, 


ities, 
or hypotheses an 
In Thurstone's 8 
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but here they are basis vectors as well as inferences. It would seem, 
therefore, that here is an advantage because for example one can infer, 
as Thurstone did, that the extremity factor in part determines stature 
and span, whereas from the tables representing the weighted data 
vectors, it cannot reasonably be inferred that the factor called the 
extremity factor plays a role in determining stature although it can 
be inferred that it is associated with span. Also it can be inferred 
from the simple structure solution that the long bones factor and the 
extremity factor have something to do with span which seems entirely 
reasonable since span is experimentally dependent upon measures of 
the long arm bones, hand length, and shoulder breadth. From the 
weighted data vectors, however, it cannot be inferred that span is in 
part determined by the long bones factor; it seems rather that span is 
largely associated with the extremities factor, which appears unreason- 
able since hand length would seem to constitute a relatively small 
part of the variance of span. 

This discrepancy in results bears close examination since the simi- 
larities between the two varimax solutions for the weighted and un- 
weighted vectors is marked. Consideration of the discrepancies can 
begin by noting that variables 1 and 5 can both be considered as being 
factorially complex and that variable 5 has the highest cosine on the 
first principal axis of the unweighted data vectors while variable 1 
has the third highest cosine; however variable 1 has rather consider- 
ably less of its variance accounted for by the first princip. 
considerably more of its variance accounted for by the 
pal axis than does variable 5. Тһе effect of the weighting is to accen- 
tuate but little the difference of the two variables on the first principal 
axis since the first principal axis receives minimum weight but to 
accentuate more the difference on the second principal axis. Тһе 
rise in cosine on the extremity factor of the variable 5 and the drop in 
cosine of variable 1 on the extremity factor in the varimax solution for 
the weighted data vectors is therefore due in large part to accentua- 
tion of the second component of the first, variable in the unweighted 
principal axes. Thus the cosine of .519 of variable 1, with the extrem- 
ity factor for the unweighted data, is to a large extent determined by 
the second principal axis of the unweighted vectors. For variable 5, 
the rise in cosine is due to relatively low weighting given to the first 
two components of the vector on the principal axes and to the rela- 


al axis and 
second princi- 
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е last two components, especially 


tively high weighting given to th 
the unweighted vari- 


the last. The high cosines for both vectors in 
max solutions derive from different principal axis components. 

| The weighting process then so operates as to cluster variables 
differentially according to their weighted similarity to principal axes. 
The weighting process minimizes successively the first components on 


the principal axes, then the second, ete. Contrariwise it might be 
stated that the weighting process maximizes successively the last 
principal axis components, the next-to-last principal axis compon- 
ents, ete. 

The solution for the unweight 


and second components of both vari 


ed data vectors emphasizes the first 
ables 1 and 5 whereas that for the 
weighted vectors emphasizes the second and third components for 
variable 1 and the last component for variable 5. Thus, when that 
which is common to all variables in 7, „т—7%+1 successive 
orthogonal subspaces js minimized, clustering indicates that as much 
as possible the cluster variables are accounted for by the same single 
component or principal axis. The low cosine for variable 1 on the 
extremity factor for the weighted vectors shows that what is common 
апа most differentiating to variables 12, 11, and 5 is not common to 
The relatively high cosine in the solution for the un- 
cates commonality between the variables due 
onents or to the same com- 
ot be told from the size 
as to limit the value 


r—l.- 


variable 1. 


weighted vectors indi 


cither to different principal axis comp 
ponents, whether it is one or the other cann 
i process 50 operates 


of the cosine. The weighting 
of cosines on the basis of both length and angle whereas the rotations 


of the unweighted vectors limit cosine values only on the basis of 
angular separation. In other words change of basis, starting with the 
principal axes, in th imple structure solution has the effect of dis- 
tributing the latent roots among the basis vectors. The “roots” of 
he principal axes tend toward equality. This 


other bases than t 
increases the variance of squared cosines on basis vectors. Minimiz- 
ing principal axis components with respect to simple structure com- 


nents amounts to minimizing variability in the rows of the trans- 
rix carrying the principal axes to the simple structure 


e si 


po 
formation mat 
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basis. When the data are unweighted this amounts to minimizing 
cosines between simple structure factors and the principal axes. 
Weighting the data vectors gives two ways of minimizing the relation- 
ship between the simple structure factors and the principal axes. The 
two ways of minimizing are rotation and operations on the lengths 
of vectors. 

Inspection of the transformation matrices carrying the principal 
axes of the weighted and unweighted vectors to their respective vari- 
max bases shows that the variability of the rows of cosines, which 
express the principal axes in the new basis, is decidedly lower for the 
weighted than for the unweighted vectors. An index of total relative 
variability for two such matrices can be regarded as 
efficiency of purifying factors. 
zero to unity is 


a measure of 
An index of efficiency on a scale from 


Ба (1) 
r= Or) 

where siis the variance of the ith row of the transformation matrix 
and 1 — (1/7) is the maximal Possible sum of the у 
r X г transformation matrix, 1 — (1/r) is the value attained when 
the transformation matrix is an identity matrix, 

As it is with the factors of factor 
vectors for the vectors of 
If one uses unweighted dat, 
basis are those selected fr 


ariances for the 


analysis, so it is with the basis 
S, the data matrix for classified behavior. 


à vectors, the best r data vectors for a data 
om the pivot vectors ide 
M. These will often, but not alw. 


nearly orthogonal data basis. 
orthogonal data basis will alway: 


ntified from matrix 
ays, be the vectors of the most 


The vectors of the most nearly 
8, however, 
with the unweighted vectors corresponding to 


When clusters of variables are to be 
data basis, the best cluster vectors ар 
As the analysis of the weighted dat. 
hypothesis formation based on vee: 


have very high cosines 
the pivot vectors in M. 


or cosines with more than one b; 
affair and therefore hazardous. 
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TABLE 1* 
VARIMAX SIMPLE STRUCTURE 
Unwercutep Вору MEASURES 


Yi Ve 
1 Stature 08 85 08 52 86 
2 Sitting Height 10 97 19 10 76 
3 Shoulder Breadth 17 30 88 33 33 
4 Hip Breadth 03 15 92 35 55 
5 Span 16 62 26 72 75 
6 Chest Breadth 19 18 94 21 56 
7 . Chest Depth -02 01 99 02 36 
S Head Length 87 18 43 15 38 
9 Head Breadth 99 —05 -04 14 39 
10 Нела Height 98 17 —13 00 35 
u Hand Length 21 50 17 83 82 
12 Hand Breadth -01 08 42 90 60 
* Decimal points are omitted. 
TABLE 2* 


VaRIMAX SIMPLE STRUCTURE 
нтер BODY MEASURES 


WEIG! 

Vs Vs Vs 

1 95 -20 20 
2 91 13 -40 
3 42 03 89 -16 
4 -21 17 97 01 
5 -14 18 -28 93 
6 13 03 96 —25 
т -10 04 92 —37 
8 97 —09 23 —06 
9 97 08 08 —21 
10 98 —03 —19 -08 
11 — 04 —25 —38 89 
12 -28 — 61 — 05 74 


* Decimal points omitted. 
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TABLE 3* 


VARIMAX TRANSFORMATION MATRIX 
(UxwxrGnTED Укстонв) 


* Decimal points omitted. 


TABLE 4* 


Varmax T RANSFORMATION MATRIX 
(СУвлантер У ECTORS) 


* Decimal points omitted, 


TABLE 5 


MATRIX or Coorpinares, 1 


T, or WEIGHTED Data 
Vectors on Pr 


NCIPAL AXES 

Weighted Body М easures Unweighted Weighted 

M, M, M; M, Invariance Invariance 
1 16 104 -о34 1.85 1017 1.62 
2 4.03 4.09 —3.61 1.58 48.44 15.09 
3 2.57 -98 2.55 gg 14.87 4.62 
4 22 1.99 3.4 7 14.47 3.19 
5 -145  —94 _ 59 1.84 6.74 1.46 
6 2.15 2.18 2.96 ва 18.56 5.28 
7 206 376 346 14 30.35 8.52 
8 457  —2.92 1.64 -53 32.35 12.59 
9 6.23 —3.16 32 19 48.91 20.68 
10 4.85 —4.88 =A -09 47.52 17.88 
11 —2.70  —2.70 —:09 178 17.76 5.66 
12 —5.37  —2.96 2.60 1.28 46.43 17.25 
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TABLE 6* 


CosixEs оғ ХҮкіснтер Dara VECTORS 
ох PnINCIPAL AXES 


Г M, Ms Ms М 
1 05 32 —73 56 
2 58 59 —52 23 
3 67 25 66 23 
4 06 52 83 20 
5 -56 —36 —23 7 
6 50 51 69 15 
7 87 68 63 08 
8 80 —51 29 09 
9 89 —45 05 02 

10 7 —71 —06 01 
11 —64 —64 -02 42 
12 —83 —43 40 19 


* Decimal points omitted. 


TABLE 7 
СоѕіхЕ оғ Ргуот VECTORS 
wiTH WEIGHTED Dara VECTORS 
9 3 7 12 
1 20 72 -21 —36 
2 23 1.00 30 —90 
3 43 24 84 —36 
4 —14 —04 90 NE. 
5 —33 —26 —52 66 
6 25 26 96 —39 
7 05 30 1.00 —35 
8 96 04 13 -81 
9 1.00 23 05 -57 
10 95 03 30 —30 
11 -18 —64 —68 88 
12 -57 —90 -85 1.00 
9 1.00 23 05 -57 
2 23 1.00 30 —90 
7 30 1.00 —35 
12 —90 —35 1.00 
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